Articles by Diego Guenzi
Architetture scalabili per memorizzazione, analisi, condivisione e pubblicazione di grosse moli di dati
Abstract
In this article we discuss a possible architecture to store, analyze and share large quantities of data (also known as Big Data). This data can be structured or not and can be generated from different sources: mobile devices, servers or, in general, sensors. This infrastructure can be integrated with analysis software like R, Solr and many others; it is developed with the cloud computing and its principles in mind so it can scale out easily and is high available. It is based on the Hadoop framework and, in particular, on the HDFS - Hbase store mechanisms (a distributed file system and a NOSQL database). Those repositories are exported with Fuse and are published with a typical Apache HTTP server with the addition of WebDav module. Fault tolerance and load balancing is achieved with HAProxy and Keepalived. All those software are free and open source (FOSS), can be installed on commodity hardware and are constantly updated by a large community of users.
This website uses only technical cookies strictly necessary for its proper functioning. It doesn't perform any profiling and doesn't use third party cookies of any kind.
Read our privacy policy for additional information.
By clicking 'OK' or closing this banner you acknowledge having read this information and accept the website's contents.