Big Data
- Details
- Last Updated on Wednesday, 16 November 2016 12:44

The amount of data is exponentially growing.
To deal with this huge amount of information, and to extract its enormous hidden value, the DBgroup is carrying on research about: data management, data analysis and data accessibility.
1. Data Management, i.e., how to handle the huge amount of data: since the volume of the data to be analysed is extremely large, the DBgroup is adopting cutting-edge technologies to manage Big Data (e.g. Apache Hadoop, Apache Spark, NoSQL/NewSQL DBMS).
2. Data Analysis, i.e., how to get valuable insight form the data, and how to extract information to drive decision making process: given the huge amount of involved data, traditional techniques for machine learning and, more generally, data analysis on “small” data are no longer applicable. Hence, the DBgroup is focused on developing new approach to work in this context and integrated with the systems for Data Management.
3. Data Accessibility, i.e., to foster the exchange and integration of data: in the described context, to actually being able to effectively and efficiently retrieve useful datasets for analysis is very challenging, due to the volume and variety of the involved data. The DBgroup is developing solutions to enable that. In particular, the focus of the DBgroup is on how to integrate difference data sources: easily accessible data sets have significantly more value if they can be easily (and automatically) integrated to each other. (This point is related also to the DBgroup research activities in the field of Linked Open Data).
News
- DBGroup research on Big Data, presentation - slides
- DBgroup will hold a course on "Big Data Analytics" in collaboration with CINECA from September 19th to September 22nd 2016.
News - Program
- The paper "Blast: a Loosely schema-aware Meta-blocking Approach for Entity Resolution", by Giovanni Simonini, Sonia Bergamaschi and H.V. Jagadish, accepted at VLDB 2016. PDF
- Big Data in Emilia. Articolo de "Il Sole 24 Ore" sul polo dei Big Data in Emila-Romagna. link
Talks
- Sonia Bergamaschi is panelist member on the session "big data" at SEBD 2016 link
- Sonia Bergamaschi invited speaker at the Workshop "PICO: the CINECA solution for Big Data management" @ headquarters of Casalecchio on December 5th 2014. The agenda of the workshop is published at this URL: http://www.hpc.cineca.it/news/workshop-pico-cineca-solutions-big-data-science. The event will be streamed at : http://streaming.cineca.it/pico/
- Professor Sonia Bergamaschi invited speaker at IC3K 2014 (http://www.ic3k.org/KeynoteSpeakers.aspx) - lecture title"Big Data integration - State of the Art &Challenges" - Roma 21-24 October 2014. Link to the program. SLIDES (PDF)
- Professor Sonia Bergamaschi invited speaker at BDAA 2014 - lecture title " Big Data Analysis: Trends & Challenges" [IEEE Proceedings of the International Conference on High Performance Computing & Simulation (HPCS 2014), pag. 303 - 304] SLIDES
Ongoing Projects & Collaborations
- Member of the CINI Big Data Lab - pdf
- Laboratorio Big Data dell'Università degli Studi di Modena e Reggio Emilia, Dipartimento di Ingegneria "Enzo Ferrari"
-
Courses "Tools and techniques for massive data analysis" promoted by Cineca
-
Entity Resolution for Big Data Integration:

- Big Data Exploration with Faceted Browsing:

Publications
- G. Simonini, S. Bergamaschi, H.V. Jagadish "Blast: a Loosely schema-aware Meta-blocking Approach for Entity Resolution", VLDB 2016 - pdf
- S. Bergamaschi et al. "Big Data Research in Italy: A Perspective" Engineering (Journal) - pdf
- G. Simonini and S. Bergamaschi: "Enhancing Entity Resolution Efficiency with Loosely Schema-aware Techniques", SEBD 2016
- S. Bergamaschi G. Simonini, S. Zhu: "Enhancing Big Data Exploration with Faceted Browsing", 10th Scientific Meeting of Classification and Analysis Group (CALDAG 2015) - link
- G. Simonini, S. Zhu: "Big data exploration with faceted browsing". In IEEE Proceedings of the International Conference on High Performance Computing & Simulation (HPCS 2015), Special Session on Big Data Principles, Architectures & Applications, Amsterdam, 20-24 July 2015.
- S. Bergamaschi, F. Guerra, G. Simonini: "Discovering the topics of a data source: a statistical approach" - SWSD Workshop @ISWC 2014
- M. Interlandi, G. Simonini: "Towards Declarative Imperative Data-parallel Systems" - 22nd Italian Symposium on Advanced Database Systems, SEBD 2014
- G. Simonini, F. Guerra: "Using big data to support automatic Word Sense Disambiguation" - IEEE International Conference on High Performance Computing & Simulation, HPCS 2014
Conference partecipations and other activities
2016
- March 2016
- Francesco Guerra will speak at the talk entitled: "Big Data: dal caos dei dati all'incremento del proprio business", Modena on 13 March 2016
2015
- November 2015
- Sonia Bergamaschi will be member of the insight review panel of the SFI Research Centre: INSIGHT-Irelands Big Data and Analytics Centre in the National University of Ireland, Galway on 25-27 November 2015.
- October 2015
- Sonia Bergamaschi is speaker at the conference CLADAG 2015 (8-10 October).
- Song Zhu is teacher of the course Toolsandtechniquesformassivedataanalysis promoted by Cineca on 14-15-16 October
- July 2015
- Francesco Guerra e Sonia Bergamaschi are track organizers of the Second International Workshop " Big Data Principles, Architecture & Applicationds (BDAA) 2015" as part of the International Conference on High Performance Computing & Simulation (HPCS 2015): http://hpcs2015.cisedu.info/2-conference/hpsc-2015-symposia/bdaa
- G. Simonini, S. Zhu: "Big data exploration with faceted browsing". In IEEE Proceedings of the International Conference on High Performance Computing & Simulation (HPCS 2015), Special Session on Big Data Principles, Architectures & Applications, Amsterdam, 20-24 July 2015.
- Francesco Guerra e Sonia Bergamaschi are track organizers of the Second International Workshop " Big Data Principles, Architecture & Applicationds (BDAA) 2015" as part of the International Conference on High Performance Computing & Simulation (HPCS 2015): http://hpcs2015.cisedu.info/2-conference/hpsc-2015-symposia/bdaa
- April 2015
- Sonia Bergamaschi and Giovanni Simonini are teachers of the "Emerging Tools and techniques for massive data analysis" promoted by Cineca on 08-09-10 April
- Sonia Bergamaschi and Giovanni Simonini are teachers of the "Emerging Tools and techniques for massive data analysis" promoted by Cineca on 08-09-10 April
2014
- December 2014
- Sonia Bergamaschi and Giovanni Simonini are teachers of the "Emerging Tools and techniques for massive data analysis" promoted by Cineca on 15-16 december, see the agenda and link
- Sonia Bergamaschi is invited speaker at the Workshop "PICO: the CINECA solution for Big Data management" that will take place at the CINECA headquarters of Casalecchio on December 5th. The agenda of the workshop is published at this URL: http://www.hpc.cineca.it/news/workshop-pico-cineca-solutions-big-data-science. The event will be streamed at : http://streaming.cineca.it/pico/
- Sonia Bergamaschi attended as panelist the Ai*IA workshop "Embracing Potential of Big Data" in Pisa on 12/12/2014. See the agenda at: http://aiia2014.di.unipi.it/bigdata/index
- October 2014
- Professor Sonia Bergamaschi is invited speaker at IC3K 2014 (http://www.ic3k.org/KeynoteSpeakers.aspx) - lecture title"Big Data integration - State of the Art &Challenges" - Roma 21-24 October 2014. Link to the program. SLIDES (PDF)
- July 2014
- Professor Sonia Bergamaschi is Session Chair at the BDAA 2014: Special Session on Big Data Principles, Architectures & Applications; as part of The International Conference on High Performance Computing & Simulation (HPCS 2014) and panelist at HPCS 2014 "New Opportunities in High Performance Data Analytics (HPDA) and High Performance Computing (HPC)
[IEEE Proceedings of the International Conference on High Performance Computing & Simulation (HPCS 2014), pag. lxiii - lxv] - Professor Sonia Bergamaschi is invited speaker at BDAA 2014 - lecture title " Big Data Analysis: Trends & Challenges" [IEEE Proceedings of the International Conference on High Performance Computing & Simulation (HPCS 2014), pag. 303 - 304] SLIDES
- G. Simonini, F.Guerra: "Using Big Data to Support Automatic Word Sense Disambiguation". In IEEE Proceedings of the International Conference on High Performance Computing & Simulation (HPCS 2014), Special Session on Big Data Principles, Architectures & Applications, Bologna, 21-25 July 2014."
- Professor Sonia Bergamaschi is Session Chair at the BDAA 2014: Special Session on Big Data Principles, Architectures & Applications; as part of The International Conference on High Performance Computing & Simulation (HPCS 2014) and panelist at HPCS 2014 "New Opportunities in High Performance Data Analytics (HPDA) and High Performance Computing (HPC)
-
March 2014
-
article on Sole 24 ORE DBGroup and Big Data
-
2013
- May 2013

