Big data term is in use since over 25 years, Big data terms commonly refers to data sets with sizes beyond the ability of commonly used software tools to capture, curate, manage and process data within a tolerable elapsed time. It is a constant moving target ranging from few terabytes to many petabytes of data. Big data technology and techniques in place to reveal insights from datasets those are diverse, complex and of a massive scale.
There are primarily two class of technology one is operational and other is analytical. A system handles operational capabilities for real-time, interactive workloads where data is primarily captured and stored and system that provide analytical capabilities for complex retrospective analysis that may touch most or all the data. Both these technology classes are deployed together.
There are many technologies to deal with big data among those Hadoop and Mongo DB are mature enough and proven technology. There are a few clear objectives to select an appropriate big data technology.
|Online vs. Offline Big Data||General Purpose vs. Niche Solutions||Developer Appeal|
|Software Licensing Models||Community||Agility|
Big data technologies to support search, development, governance and analytics services for variety of all data types those are from transaction and application data to machine and sensor data to social, image and geospatial data, and more.