This chapter introduces the concept of heterogeneity as a perspective in the architecture of big data systems targeted to both vertical and generic workloads and discusses how this can be linked with the existing Hadoop ecosystem (as of 2015). The case of the cost factor of a big data solution and its characteristics can influence its architectural patterns and capabilities and as such an extended model based on the 3V paradigm is introduced (Extended 3V). This is examined on a hierarchical set of four layers (Hardware, Management, Platform and Application). A list of components is provided on each layer as well as a classification of their role in a big data solution.
|Title of host publication||Managing and Processing Big Data in Cloud Computing|
|Editors||Rajkumar Kannan, Raihan Ur Rasool, Hai Jin, S.R Balasundaram|
|Publisher||Idea Group Publishing|
|Number of pages||28|
|Publication status||Published - Jan 2016|
|Name||Advances in Data Mining and Database Management (ADMDM)|