Big data means really a big data, it is a collection of large data sets that cannot be processed using traditional computing techniques. Big data is not merely a data, rather it has become a complete subject, which involves various tools, techniques and frameworks.
Big data involves the data produced by different devices and applications.
Black Box Data
Social Media Data
Stock Exchange Data
Power Grid Data
Transport Data
Search Engine Data
Benefits of Big Data
Big data is really critical to our life and its emerging as one of the most important technologies in modern world. Follow are just few benefits which are very much known to all of us:
Using the information kept in the social network like Facebook, the marketing agencies are learning about the response for their campaigns, promotions, and other advertising mediums.
Using the information in the social media like preferences and product perception of their consumers, product companies and retail organizations are planning their production.
Using the data regarding the previous medical history of patients, hospitals are providing better and quick service.
Hadoop
Hadoop is a collection of open-source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the Map Reduce programming model. Originally designed for computer clusters built from hardware still the common use it has also found use on clusters of higher-end hardware. All the modules in Hadoop are designed with a fundamental assumption that hardware failures are common occurrences and should be automatically handled by the framework.
Applications
log and/or click stream analysis of various kinds
marketing analytics
machine learning and/or sophisticated data mining
image processing
processing of XML messages
web crawling and/or text processing
general archiving, including of relational/tabular data, e.g. for compliance
Certification
Exam CCA175 Cloudera Spark and Hadoop Developer Certification
Course Curriculum
Big data | |||
Big Data Opportunities & Challenges | 00:00:00 | ||
OOPS & Java Fundamentals | 00:00:00 | ||
Understanding Linux Commands | 00:00:00 | ||
Introduction to Hadoop | 00:00:00 | ||
Getting Started with Hadoop | 00:00:00 | ||
Pseudo Cluster Environment – Setting up Hadoop Cluster | 00:00:00 | ||
MapReduce | 00:00:00 | ||
Installing PIG, HIVE, HBASE, SQOOP,ZooKeeper, Oozie | 00:00:00 | ||
Hadoop Admin | 00:00:00 |