Introduction and Overview
Quantifying the amount of digital information that exists in the world is hard. What is clear is that there is an awful lot of it, and it is growing at a terrific rate. Modern day businesses are not an exception; it accumulates an astonishing amount of digital data, which may be leveraged to unlock new sources of economic value or to provide fresh insights into business trends. The real challenge in this process is the design of computing, storage infrastructures and algorithms needed to handle this “big data” problem. Web being the largest collection of digital data, internet companies have contributed several exceptional technologies to efficiently handle this issue.
This course will introduce the participant to the new era technologies developed to handle the “big data” challenge. It will give exposure to a distributed computing/storage paradigm designed to solve big data problems. The course will also discuss some algorithmic approaches employed by organizations dealing with big data problems.
Participants are recommended to have preferably minimum 2 years of experience in software
development with Java/Unix/Linux environment and good understanding on data/business analytics.