Apache Pig and Hive Installation Single Node Machine
  • By Sachin Patil
  • September 27, 2019
  • Big DataHadoop

Apache Pig and Hive Installation Single Node Machine The Apache Hadoop software library is a framework that allows the data distributed processing across clusters for computing using simple programming models…

R Visualizations – ggplot2 (PART-2)
  • By Rahul Pund
  • September 25, 2019
  • Big Data

R Visualizations- Part 2 R Visualizations - ggplot2  (PART-2)   Distribution Study of how and where data points are distributed is very important in large amount of data. Histogram Histogram…

R Visualizations – ggplot2
  • By Rahul Pund
  • August 28, 2019
  • Big Data

R Visualizations - ggplot2  (PART-1) Type of visualization using ggplot2 and their implementations using R-language: . There are 8 different categories of models you may construct plots.  A) Correlation:- Scatterplot,…

R Hadoop – A perfect model for Big Data
  • By Harshal Patil
  • August 24, 2019
  • Big DataHadoop

Introducing R with Hadoop R is an open-source software system package to perform applied math analysis on knowledge. R is a programming language used by data scientist statisticians and others…

Hadoop Map Reduce Programs for Word Count with Steps
  • By Sachin Patil
  • July 22, 2019
  • Big DataHadoop

Hadoop Map Reduce Programs for Word Count with Steps Introduction: Hadoop is an open source software framework designed for storage and processing of large scale variety of data on clusters…

Need of Apache Spark for Big Data Processing
  • By Sachin Patil
  • June 15, 2019
  • Big Data

Need of Apache Spark for Big Data Processing What is Spark - Apache Spark is an open source framework for big data processing which is built for speed, easy use, and…