Big Data

 

Big-Data-Blog-ImageThe amount of data is exponentially growing.

To deal with this huge amount of information, and to extract its enormous hidden value, the DBgroup is carrying on research about: data management, data analysis and data accessibility.
 
1. Data Management, i.e., how to handle the huge amount of data: since the volume of the data to be analysed is extremely large, the DBgroup is adopting cutting-edge technologies to manage Big Data (e.g. Apache Hadoop, Apache Spark, NoSQL/NewSQL DBMS).
 
2. Data Analysis, i.e., how to get valuable insight form the data, and how to extract information to drive decision making process. The DBgroup is focused on developing new approach to work in this context and integrated with the systems for Data Management.
 
3. Data Accessibility, i.e., to foster the exchange and integration of data: in the described context, to actually being able to effectively and efficiently retrieve useful datasets for analysis is very challenging, due to the volume and variety of the involved data. The DBgroup is developing solutions to enable that. In particular, the focus of the DBgroup is on how to integrate difference data sources: easily accessible data sets have significantly more value if they can be easily (and automatically) integrated to each other. (This point is related also to the DBgroup research activities in the field of Linked Open Data).


News
Talks

Projects & Collaborations
Publications
Other activities


 

News

 

 

Academy “Metodologie, tecniche e tool per l’analisi dei Big Data”

  • DBgroup Università di Modena & CINECA
    • 50 ore di formazione: 30 ore didattica + 20 ore laboratorio big data con infrastruttura CINECA 
    • 20 postidisponibili
    • 10 borse di studio da 1,500€
    • Iscrizione: 3,000€

 

Assegni a favore delle imprese (previsione maggio 2017 – aprile 2018)

  • BPER: “Big Data e Analytics per lo sviluppo del comportamento digitale del cliente da prospect ad acquisito”
  • DOXEE: “Metodologia di progettazione di applicazioni sui Big Data basata su tecnologia Amazon Web Services“
  • ExpertSystem: “Data Scientist per supportare il processo di produzione di intelligence (Corporate Intelligence Data Scientist)”

Delibera n. 554 del 28/04/2017

 

Talks

 

  • Sonia Bergamaschi invited talk at "Piano Industria 4.0: Linee Operative Per Gli Ingegneri Professionisti": "Tecnologie Esistenti e Sfide. Come l'università affronta queste problematiche. " - 03/02/2018 - slides

  • Sonia Bergamaschi invited talk at "i nuovi eroi", auditorium Ferrari, Maranello (Mo), Italy - 4/10/2017 - slides

  • Sonia Bergamaschi invited talk at Accademia Nazionale di Scienze Lettere e Arti di Modena: "Big Data: opportunità e sfide" - slides

  • Sonia Bergamaschi invited talk at Warrant: "Big Data nell'industria 4.0" - slides

  • Sonia Bergamaschi invited seminar at Paris Descartes University 15/03/2017
     
  • Sonia Bergamaschi at BigDat 2017, Bari, Italy - slides
     
  • Sonia Bergamaschi is panelist member on the session "big data" at SEBD 2016 - link

  • Sonia Bergamaschi invited speaker at the Workshop "PICO: the CINECA solution for Big Data management" @ headquarters of Casalecchio on December 5th 2014. The agenda of the workshop is published at this URLhttp://www.hpc.cineca.it/news/workshop-pico-cineca-solutions-big-data-scienceThe event will be streamed at : http://streaming.cineca.it/pico/

 


  • Professor Sonia Bergamaschi invited speaker at BDAA 2014 - lecture title  " Big Data Analysis: Trends & Challenges" [IEEE Proceedings of  the International Conference on High Performance Computing & Simulation (HPCS 2014), pag. 303 - 304SLIDES


Projects & Collaborations

slides g
  • Big Data Exploration with Faceted Browsing [6]:
slides g2

Publications

[14] Simonini, G., Gagliardelli L., Bergamaschi, S., & Jagadish, H. V. (2019). Scaling Entity Resolution: A Loosely Schema-aware Approach. Information Systems.
[13] Gagliardelli, L., Simonini, G., Beneventano D., & Bergamaschi S. (2019). SparkER: Scaling Entity Resolution in Spark. In 22nd International Conference on Extending Database Technology (EDBT 2019). poster
[12] 
Simonini, G., Papadakis, G., Palpanas, T., & Bergamaschi, S. (2018). Schema-agnostic Progressive Entity Resolution. IEEE Transactions on Knowledge and Data Engineering.

[11] S. Bergamaschi, G. Fiameni, G. Simonini, Z. Song “SOPJ: A Scalable Online Provenance Join for Data Integration”, IEEE International Conference on High Performance Computing & Simulation, HPCS 2017.

[10] S. Bergamaschi, L. Gagliardelli, G. Simonini, S. Zhu "BigBench workload executed by using Apache Flink", 27th International Conference on Flexible Automation and Intelligent Manufacturing, Modena 2017

[9] G. Simonini, S. Bergamaschi, H.V. Jagadish "Blast: a Loosely schema-aware Meta-blocking Approach for Entity Resolution", VLDB 2016.

[8] S. Bergamaschi et al.: ”Big Data Research in Italy: a perspective ", Enginering Journal (2), 2016.

[7] S. Bergamaschi G. Simonini, S. Zhu: "Enhancing Big Data Exploration with Faceted Browsing",
Classification, (Big) Data Analysis and Statistical Learning 2017.

[6] S. Bergamaschi G. Simonini, S. Zhu: "Enhancing Big Data Exploration with Faceted Browsing", 10th Scientific Meeting of Classification and Analysis Group (CLADAG 2015).

[5] G. Simonini, S. Zhu: "Big data exploration with faceted browsing". In IEEE Proceedings of the International Conference on High Performance Computing & Simulation (HPCS 2015), Special Session on Big Data Principles, Architectures & Applications, Amsterdam, 20-24 July 2015.

[4] S. Bergamaschi, D. Ferrari, F. Guerra, G. Simonini, Y. Velegrakis: "Providing insight into data source topics." Journal on Data Semantics 5.4 (2016): 211-228.

[3] S. Bergamaschi, D. Ferrari, F. Guerra, G. Simonini: "Discovering the topics of a data source: a statistical approach" - SWSD Workshop @ISWC 2014.

[2] M. Interlandi, G. Simonini: "Towards Declarative Imperative Data-parallel Systems”, 22nd Italian Symposium on Advanced Database Systems, SEBD 2014.

[1] G. Simonini, F. Guerra: "Using big data to support automatic Word Sense Disambiguation”, IEEE International Conference on High Performance Computing & Simulation, HPCS 2014.



Other activities


2017

 

  • April 2017
    • DBgroup hold the course "Big Data Tools" for the "ordine degli ingegneri" (Italian Engineering Association)

 

2016

 


2015

  • November 2015
    • Sonia Bergamaschi will be member of the insight review panel of the SFI Research Centre: INSIGHT-Irelands Big Data and Analytics Centre in the National University of Ireland, Galway on 25-27 November 2015.
  • October 2015
    • Sonia Bergamaschi is speaker at  the conference CLADAG 2015 (8-10 October).
    • Song Zhu is teacher of the course Toolsandtechniquesformassivedataanalysis promoted by Cineca on 14-15-16 October
  • July 2015
    • Francesco Guerra e Sonia Bergamaschi are track organizers of the Second International Workshop " Big Data Principles, Architecture & Applicationds (BDAA) 2015" as part of the International Conference on High Performance Computing & Simulation (HPCS 2015): http://hpcs2015.cisedu.info/2-conference/hpsc-2015-symposia/bdaa
    • G. Simonini, S. Zhu: "Big data exploration with faceted browsing". In IEEE Proceedings of  the International Conference on High Performance Computing & Simulation (HPCS 2015), Special Session on Big Data Principles, Architectures & Applications,  Amsterdam, 20-24 July 2015.
  • April 2015
    • Sonia Bergamaschi and Giovanni Simonini are teachers of the "Emerging Tools and techniques for massive data analysis" promoted by Cineca on 08-09-10 April
       
2014

 


  • July 2014 
    • Professor Sonia Bergamaschi is Session Chair at the BDAA 2014Special Session on Big Data Principles, Architectures & Applications; as part of The International Conference on High Performance Computing & Simulation (HPCS 2014and  panelist at HPCS 2014 "New Opportunities in High Performance Data Analytics (HPDA) and High Performance Computing (HPC) 
      [IEEE Proceedings of  the International Conference on High Performance Computing & Simulation (HPCS 2014), pag. lxiii - lxv]
    • Professor Sonia Bergamaschi is invited speaker at BDAA 2014 - lecture title  " Big Data Analysis: Trends & Challenges" [IEEE Proceedings of  the International Conference on High Performance Computing & Simulation (HPCS 2014), pag. 303 - 304SLIDES
    • G. Simonini, F.Guerra: "Using Big Data to Support Automatic Word Sense Disambiguation". In IEEE Proceedings of  the International Conference on High Performance Computing & Simulation (HPCS 2014), Special Session on Big Data Principles, Architectures & Applications,  Bologna, 21-25 July 2014." 

 

2013
  • May 2013
    • The DBGROUP contributed to the  whitepaper “UNLEASHING THE POTENTIAL OF BIG DATA” (link)
      The Whitepaper “UNLEASHING THE POTENTIAL OF BIG DATA” (link) is based on the 2013 World Summit on Big Data and Organization Design, initiated by the Organizational Design Community (ODC) and co-sponsored by IBM. 

Copyright @  2019   DataBase Group for suggestions write to  Webmaster