Читать книгу Big Data - Seifedine Kadry - Страница 16
1.3.1 Data Mining vs. Big Data
ОглавлениеTable 1.2 shows a comparison between data mining and big data.
Table 1.1 Differences in the attributes of big data and RDBMS.
ATTRIBUTES | RDBMS | BIG DATA |
---|---|---|
Data volume | gigabytes to terabytes | petabytes to zettabytes |
Organization | centralized | distributed |
Data type | structured | unstructured and semi‐structured |
Hardware type | high‐end model | commodity hardware |
Updates | read/write many times | write once, read many times |
Schema | static | dynamic |
Table 1.2 Data Mining vs. Big Data.
S. No. | Data mining | Big data |
---|---|---|
1) | Data mining is the process of discovering the underlying knowledge from the data sets. | Big data refers to massive volume of data characterized by volume, velocity, and variety. |
2) | Structured data retrieved from spread sheets, relational databases, etc. | Structured, unstructured, or semi‐structured data retrieved from non‐relational databases, such as NoSQl. |
3) | Data mining is capable of processing large data sets, but the data processing costs are high. | Big data tools and technologies are capable of storing and processing large volumes of data at a comparatively lower cost. |
4) | Data mining can process only data sets that range from gigabytes to terabytes. | Big data technology is capable of storing and processing data that range from petabytes to zettabytes. |