News

This paper provides a high-level overview of how Apache Cassandraâ„¢ can be used to replace HDFS, with no programming changes required from a developer perspective, and how a number of compelling ...
Big data can mean big threats to security, but BlueTalon just launched what it calls the first-ever filtering and dynamic masking capabilities for use directly on the Hadoop Distributed File ...
The good news is Hadoop is one of the most cost-effective ways to store huge amounts of data. You can store all types of structured, semi-structure, and unstructured data within the Hadoop Distributed ...
Quantcast, an internet audience measurement and ad targeting service, processes over 20 petabytes of data per day using Apache Hadoop and its own custom file system called Quantcast File System ...
Doug Cutting, creator of the distributed computing platform Hadoop, on why the platform is in an almost unassailable position and what's in store for the platform.
SAP is using the Hadoop distro vendor MapR's file system in its cloud storage layer, and not just for Hadoop/Big Data.
Several distributed file systems are used over the cloud because the cloud itself includes large numbers of commodity-grade servers, harnessed to deliver highly scalable and on-demand services.
At its core, we have MapReduce, YARN and the Hadoop Distributed File System, but the number of peripheral Apache projects that compliment Hadoop -- including Ambari, Chukwa, Avro, HBase and Mahout -- ...