Every organization, who’s facing challenges with growing colossal amount of information which is coming as streaming data- is now uses distributed, parallel and batch processing framework such as Hadoop.
Statistical Analysis of one minute internet search.
- Google receives over 2,000,000 search queries.
- Facebook receive 34,700 “likes”.
- Consumers spend $272,070 on web shopping.
- Apple receives around 47,00 app download requests for every 1 minute.
- 37,000 minutes skype calls on every minute internet.
- Twitter receives 98,000 tweets for every one minute.
- 20,000 posts per minute on blogging platform Tumblr
- 13,000 hours of music streaming on Pandora.
- 6,600 pictures uploaded every minute to Flickr.
- Around 600 new videos are uploaded to every minute in YouTube.
Information Data Corporation (IDC) estimates data created in 2010 to be 1.2 Zetta bytes (1.2 trillion Giga bytes). Companies continue to generate large amounts of data. Here are some 2011 stats.
- 6 billion messages generated per day on Facebook.
- 2 billion page views a day on eBay will use 9 Petabytes of storage.
- Google process 20 Peta bytes a day.
- Facebook has 2.5 Petabytes of user data.
- eBay has 6.5 Petabytes of user data.
More common business domains where Hadoop is used.
- Retail / eCommerce
- Security Surveillance
- Traffic Control
- Log Analysis
- Web Search / Web crawlers
- Trading Analysis
- Fraud / Risk
Users of Hadoop
All well known companies who are flooded with streaming data daily are using distributed parallel processing framework, Hadoop. To name some of them who are users of hadoop are Facebook,Yahoo, LinkedIn, Twitter, eBay, Adobe, Amazon etc.,