Today, competition has led companies to identify their ability to make informed decisions as a key to success. Technology has played a very important role in this regard, enabling us to record the operational data of the company's daily activities and to analyze it afterwards. These types of systems are known as decisional systems, and data warehouses are the most widely used architecture for implementing them. However, thanks in part to the success of this type of system, the paradigm of analysis is changing, and a new type of data is being created, which is no longer only associated with the company's day to day activities, but also considers its "environment": social networks, logs, open data, etc.
The requirements of these new data types differ from those of the old ones, and have shown the limitations of traditional architectural solutions. For all these reasons, the term Big Data is currently used to refer to this new type of system and the challenges they entail. The most popular definition of the term Big Data is based on the three Vs, which represent its three main challenges: volume (large volumes of data), variety (heterogeneous data sources) and velocity (referring to processing and response times).
To address these three major challenges, Big Data today is based on the principle of "divide-and-conquer" according to which the problems have been formalized into sub-problems that can be executed simultaneously. For this reason, most Big Data solutions are based on "Cloud Computing" and on developing distributed systems in the cloud.
Big Data systems are computer systems that are based on similar design plans to all the others. We can therefore talk about the management of data in Big Data systems (Big Data Management) and using these data to extract knowledge relevant to the organization with Data Mining and Machine Learning algorithms (Big Data Analytics). Unlike traditional systems, however, there is not so much justification to separate the data exploitation management part, as there is no universal solution for storing data and exploiting them in a Big Data environment. Instead, the architectural solution depends on the specific case of use (exploitation) being considered.
This Postgraduate programme provides an overview of Big Data ecosystem and considers both aspects in depth: management (Big Data Management) and exploitation of data (Big Data Analytics), while providing applicability and a business vision within this system.