Distribuera mera - Spark och Hadoop utan Big Data
Distribution as a concept means that a task (for example, data storage or code execution) is parallelized on multiple computers. It goes hand in hand with the concept of big data – extreme amounts of data that can’t be processed by a single computer. Because of this, the most established tools for distributed parallelization is tools that are designed to handle big data. This thesis explores wheth