Fundamentals Data Science

Data Science

In our research on Intelligent Systems and Data Science, we use semantic technologies to model and reason with large amounts of data, as well as machine learning and statistical techniques for mining knowledge from data in a variety of application domains. These include (but are not limited to) Smart Cities, Scholarly Data and Digital Humanities.

Furthermore, our interest lies on understanding what are the fundamental features of a “Science of Data”. Here, we focus on the way different types of data can be described to support their discovery, reuse and governance in complex processing infrastructures. In particular, in the context of developing intelligent metadata management applications, we have developed methods that can reason with semantic representations of policies and data flows to determine how licences and policies propagate upon complex data flows – see http://oro.open.ac.uk/52707/.

We are currently also working on developing new solutions for data cataloguing, aiming at improving the way such important assets are stored and managed as ‘libraries of data’. This work supports the data cataloguing infrastructure developed for the MK Data Hub, but it is of course, also applicable to other contexts.

To support Data Science research and development, the ISDS team manages the KMi Big Data Cluster, a processing infrastructure based on Apache Hadoop.

The ISDS team also develops and maintains the Open Knowledge Graph of The Open University and promotes an open approach to the dissemination and reuse of research outputs. Initiated in 2010, this was the first Open Knowledge Graph in UK academia leveraging Linked Data technologies in the education domain.

Follow us on Twitter

Team

Alessio Antoninis PhotoAlessio AntoniniResearch Associate
Jason Carvalhos PhotoJason CarvalhoProject Officer - Data Hub Development
Enrico Dagas PhotoEnrico DagaResearch Fellow
Alba Morales Tirados PhotoAlba Morales TiradoPhD Research Student
Enrico Mottas PhotoEnrico MottaProfessor of Knowledge Technologies

News

SPIROCCO’s App for community-based care

SPIROCCO’s App for community-based care

As part of the H2020 GATEKEEPER, we are developing an App with the SME SPIROCCO to support community-based care. SPIROCCO is a winner of €50k funds of the 1st GK Open Call, addressing our challenge...

Agnese presents her research on robots with common sense at the AAAI Spring Symposium

The 2022 AAAI Symposium on Machine Learning and Knowledge Engineering for Hybrid Intelligence (AAAI-MAKE) took place on March 21-23. The symposium is aimed at bringing together researchers and...

SPARQL Anything at the meeting of the Knowledge Graph Construction W3C community group

SPARQL Anything at the meeting of the Knowledge Graph Construction W3C community group

On 14 February, Enrico Daga joined the biweekly meetings of the Knowledge Graph Construction community group to give a presentation on SPARQL Anything. W3C is a community group that aims to support...

Testing the GATEKEEPER robotic intervention in KMi

Testing the GATEKEEPER robotic intervention in KMi

As part of the GATEKEEPER project, our Gianluca is working on autonomous inspection of home environments aimed to identify and map objects of daily living and identify potential hazards. In...

Testing Robotic Vision through VR

Testing Robotic Vision through VR

As a part of the GATEKEEPER project, we are collaborating with Extend Robotics on the development of VR software for remote presence through the TiaGo robot. The first step is the reconstruction of...

The Semantic Web Community Celebrates its 20th Birthday

The Semantic Web Community Celebrates its 20th Birthday

This year marks the 20th anniversary of the ‘official’ birth of the Semantic Web community, as 2001 saw both the publication of the landmark paper on the Semantic Web by Tim Berners-Lee, Jim...

Watch the video from the 2021 SciRoc Challenge!

Watch the video from the 2021 SciRoc Challenge!

Well… what an event! A huge thank you again to all of our teams, sponsors, volunteers and members of the public for joining us to support the 2021 SciRoc Challenge, and to the city of Bologna for...

Event news: The Future and the Past of Reading (18.11.2021), Milano, Italy

Event news: The Future and the Past of Reading (18.11.2021), Milano, Italy

READ-IT members will take part in a workshop, ‘The Future and the Past of Reading. New Research Methods for new Perspectives’ at the Università degli Studi di Milano, Italy, on Thursday 18...

The Open University’s project period ends

The Open University’s project period ends

On 30 September 2021, The Open University’s funded period on the project READ-IT comes to an end. While the project team Dr. Shafquat Towheed (UK Lead), Dr. Alessio Antonini, Dr. Francesca Benatti,...

Publications

2022

Reyero Lobo, P., Daga, E. and Alani, H. (2022) Supporting Online Toxicity Detection with Knowledge Graphs, Sixteenth International AAAI Conference on Web and Social Media, Atlanta, Georgia


Reyero Lobo, P., Mensio, M., Pavon Perez, A., Bayer, V., Kwarteng, J., Fernandez, M., Daga, E. and Alani, H. (2022) Estimating Ground Truth in a Low-labelled Data Regime: A Study of Racism Detection in Spanish, Workshop on Novel Evaluation Approaches for Text Classification Systems on Social Media (NEATCLasS), Atlanta, Georgia


Allocca, C., Jilali, S., Ail, R., Lee, J., Kim, B., Antonini, A., Motta, E., Schellong, J., Stieler, L., Haleem, M., Georga, E., Pecchia, L., Gaeta, E. and Fico, G. (2022) Toward a Symbolic AI Approach to the WHO/ACSM Physical Activity Sedentary Behavior Guideline, Applied Sciences, 12, MDPI


Chiatti, A., Motta, E. and Daga, E. (2022) Robots with Commonsense: Improving Object Recognition through Size and Spatial Awareness, Workshop: the 2022 AAAI Spring Symposium on Machine Learning and Knowledge Engineering for Hybrid Intelligence (AAAI-MAKE 2022), CEUR


Bardaro, G., Daga, E., Carvalho, J., Chiatti, A. and Motta, E. (2022) Introducing a Smart City Component in a Robotic Competition: A Field Report, 9, Frontiers Media S.A.


Morales Tirado, A., Daga, E. and Motta, E. (2022) CONRAD - Health Condition Radar: an Intelligent System for Emergency Support, Proceedings of 5th Workshop on Semantic Web solutions for large-scale biomedical data analytics co-event with The ESWC 2022: Extended Semantic Web Conference, Hersonissos, Greece


Morales Tirado, A., Daga, E. and Motta, E. (2022) HECON: Health Condition Evolution Ontology, Proceedings of 5th Workshop on Semantic Web solutions for large-scale biomedical data analytics co-event with The ESWC 2022: Extended Semantic Web Conference, Hersonissos, Greece


2021

Morales Tirado, A., Daga, E. and Motta, E. (2021) Reasoning on health condition evolution for enhanced detection of vulnerable people in emergency settings, K-CAP '21: Proceedings of the 11th on Knowledge Capture Conference, Virtual Event, USA


Daga, E., Asprino, L., Mulholland, P. and Gangemi, A. (2021) Facade-X: An Opinionated Approach to SPARQL Anything Volume 53: Further with Knowledge Graphs, eds. Mehwish Alam,Paul Groth,Victor de Boer,Tassilo Pellegrini,Harshvardhan J. Pandit, 53, pp. 58-73, IOS Press


Chiatti, A., Motta, E., Daga, E. and Bardaro, G. (2021) Fit to Measure: Reasoning about Sizes for Robust Object Recognition, Proceedings of the AAAI2021 Spring Symposium on Combining Machine Learning and Knowledge Engineering (AAAI-MAKE 2021), International Virtual Event


Daga, E., Meroño-Peñuela, A. and Motta, E. (2021) Sequential Linked Data: the State of Affairs, pp. (In press)


Mulholland, P., Daga, E., Daquino, M., Díaz-Kommonen, L., Gangemi, A., Kulfik, T., Wecker, A., Maguire, M., Peroni, S. and Pescarin, S. (2021) Enabling multiple voices in the museum: Challenges and approaches, Digital Culture & Society, pp. (In Press)


Daga, E., Asprino, L., Damiano, R., Agudo, B., Gangemi, A., Kuflik, T., Lieto, A., Marras, A., Pandiani, D., Mulholland, P., Peroni, S., Pescarin, S. and Wecker, A. (2021) Integrating citizen experiences in cultural heritage archives: requirements, state of the art, and challenges, pp. (In Press)


Bardaro, G., Antonini, A. and Motta, E. (2021) Robots for Elderly Care in the Home: A Landscape Analysis and Co-Design Toolkit, International Journal of Social Robotics, pp. (Early Access)


Antonini, A., Benatti, F., Watson, N., King, E. and Gibson, J. (2021) Death and Transmediations: Manuscripts in the Age of Hypertext, HT '21: Proceedings of the 32th ACM Conference on Hypertext and Social Media, Virtual Event USA


Benatti, F. and Antonini, A. (2021) Into the Macroscope: Systematic integration of micro- and macro-scale study of digital reading, The 17th IGEL Conference, University of Liverpool (Remote)


2020

Morales Tirado, A., Oelen, A., Pasqual, V., Shi, M., Umbrico, A., Xu, W. and Celino, I. (2020) A human-in-the-loop framework to handle implicit bias in crowdsourced KGs, ISWS International Semantic Web Research Summer School 2019, Bertinoro, Italy


Bruni, L., Daga, E., Damiano, R., Diaz, L., Kuflik, T., Lieto, A., Gangemi, A., Mulholland, P., Peroni, S., Pescarin, S. and Wecker, A. (2020) Towards Advanced Interfaces for Citizen Curation, Workshop on Advanced Visual Interfaces and Interactions in Cultural Heritage (AVI2CH 2020), Ischia, Italy


Morales Tirado, A., Daga, E. and Motta, E. (2020) Effective use of personal health records to support emergency services, EKAW 2020 - 22nd International Conference on Knowledge Engineering and Knowledge Management, Bolzano, Italy


Chiatti, A., Motta, E. and Daga, E. (2020) Towards a Framework for Visual Intelligence in Service Robotics:Epistemic Requirements and Gap Analysis, Proceedings of the 17th International Conference on Principles of Knowledge Representation and Reasoning (KR 2020)


Lupi, L. and Antonini, A. (2020) Readapting Propp's character archetypes to explore the relational dimension of city data: a design-oriented approach Readapting Propp's character archetypes to explore the relational dimension of city data: a design-oriented approach, pp. 200-229


Vignale, F., Antonini, A. and Gravier, G. (2020) The Reading Experience Ontology (REO): Reusing and Extending CIDOC CRM, Digital Humanities Conference 2020, Ottawa


Antonini, A., Benatti, F. and King, E. (2020) Restoration and Repurposing of DH legacy projects: the UK-RED case, Digital Humanities Conference 2020, Ottawa


Antonini, A., Benatti, F. and Blackburn-Daniels, S. (2020) On Links To Be: Exercises in Style #2, 31st ACM Conference on Hypertext and Social Media (HT'20), Online


Antonini, A. and Brooker, S. (2020) Mediation as Calibration: A Framework for Evaluating the Author/Reader Relation, Proceedings of the 31st ACM HyperText, Orlando, Florida, USA


Antonini, A. and Benatti, F. (2020) *ing the Written Word: Digital Humanities Methods for Book History, SHARP 2020: Power of the Written Word, Amsterdam


Motta, E., Daga, E., Opdahl, A. and Tessem, B. (2020) Analysis and Design of Computational News Angles Analysis and Design of Computational News Angles, 8, pp. 120613-120626


Antonini, A., Vignale, F. and Gravier, G. (2020) READ-IT deliverable D2 - Model of the State of Mind V1.7


Antonini, A., (2020) Understanding the phenomenology of reading through modelling Understanding the phenomenology of reading through modelling, pp. (Early Access)


Lupi, L., Antonini, A., De Liddo, A. and Motta, E. (2020) Actionable Open Data: Connecting City Data to Local Actions Actionable Open Data: Connecting City Data to Local Actions, 16, pp. (In Press)


2019

Daga, E. and Motta, E. (2019) Capturing themed evidence, a hybrid approach, Proceedings of the 18th International Conference on Knowledge Capture, Marina del Rey, California, United States


Antonini, A. and Lupi, L. (2019) Social AI for Engaging UbiComp, Halfway to the Future, Nottingham, UK


Daga, E. and Motta, E. (2019) Challenging knowledge extraction to support the curation of documentary evidence in the humanities, Third International Workshop on Capturing Scientific Knowledge (Sciknow). Collocated with the tenth International Conference on Knowledge Capture (K-CAP), Los Angeles, California, USA


Daga, E., Meroño-Peñuela, A. and Motta, E. (2019) Modelling and Querying Lists in RDF. A Pragmatic Study, QuWeDa 2019: 3rd Workshop on Querying and Benchmarking the Web of Data, Auckland, New Zealand


Meroño-Peñuela, A. and Daga, E. (2019) List.MID: A MIDI-Based Benchmark for Evaluating RDF Lists, International Semantic Web Conference (ISWC 2019), The University of Auckland, New Zealand


Vignale, F., Benatti, F. and Antonini, A. (2019) Reading in Europe - Challenge and Case Studies of READ-IT Project, DH2019, Utrecht, Netherland


Lupi, L. and Antonini, A. (2019) City Planning and Web-based technologies: misalignments, convergences, and potential future directions, 16th International Conference on Computers in Urban Planning and Urban Management, Wuhan, China


Lupi, L. and Antonini, A. (2019) City Planning and Urban Informatics: misalignments, convergences, and potential future direction for web-based technologies, 16th International Conference on Computers in Urban Planning and Urban Management, Wuhan, China


Antonini, A., Benatti, F., King, E., Vignale, F. and Gravier, G. (2019) Modelling Changes in Diaries, Correspondence and Authors' Libraries to support research on reading: the READ-IT approach, Workshop: Open Data and Ontologies for Cultural Heritage (ODOCH'19) at CAiSE '19, Rome, Italy


Morales Tirado, A., Daga, E. and Motta, E. (2019) Towards a privacy aware information system for emergency response, 16th International Conference on Information Systems for Crisis Response and Management, Valencia, Spain


Antonini, A. and Lupi, L. (2019) Developing a meta-language in multidisciplinary research projects: the case study of READ-IT, Workshop: W14: Standing on the Shoulders of Giants: Exploring the Intersection of Philosophy and HCI at CHI 2019, Glasgow, UK


Antonini, A., Mejia, G. and Lupi, L. (2019) All We Do is Stalking - Studying New Forms of Reading in Social Networks, HyperText 2019, Hof, Germany, ACM


Tiddi, I., Bastianelli, E., Daga, E., d'Aquin, M. and Motta, E. (2019) Robot-City Interaction: Mapping the Research Landscape - A Survey of the Interactions Between Robots and Modern Cities Robot-City Interaction: Mapping the Research Landscape - A Survey of the Interactions Between Robots and Modern Cities, pp. (Early Access)


Daga, E. and Gangemi, A. (2019) Linked Data for the Humanities: methods and techniques, Digital Humanities 2019: Complexities, Uthrect


Antonini, A., Vignale, F., Guillaume, G. and Brigitte, O. (2019) The Model of Reading: Modelling principles, Definitions, Schema, Alignments


2018

Antonini, A. and Lupi, L. (2018) From Service to Data Infrastructure - The Transition from MK Intelligence Observatory to MK:Insight


Meroño-Peñuela, A., Valk, R., Daga, E., Daquino, M. and Kent-Muller, A. (2018) The Semantic Web MIDI Tape: An Interface for Interlinking MIDI and Context Metadata, Workshop: Workshop on Semantic Applications for Audio and Music (SAAM) held in conjunction with ISWC 2018., Monterey, California, USA


Meroño-Peñuela, A., Kent-Muller, A., Valk, R., Daquino, M. and Daga, E. (2018) A Large-Scale Semantic Library of MIDI Linked Data, DLfM '18 - 5th International Conference on Digital Libraries for Musicology, Institut de Recherche et Coordination Acoustique/Musique (IRCAM), Paris, France


Daga, E., Gangemi, A. and Motta, E. (2018) Reasoning with Data Flows and Policy Propagation Rules, Semantic Web - Interoperability, Usability, Applicability, IOS Press


Daga, E. (2018) Knowledge Components and Methods for Policy Propagation in Data Flows Knowledge Components and Methods for Policy Propagation in Data Flows


2017

Daga, E., d'Aquin, M. and Motta, E. (2017) Propagating Data Policies: a User Study, Knowledge Capture (K-Cap 2017), Austin, Texas (US) Proceedings of the Knowledge Capture Conference


Daquino, M., Daga, E., d'Aquin, M., Gangemi, A., Holland, S., Laney, R., Penuela, A. and Mulholland, P. (2017) Characterizing the Landscape of Musical Data on the Web: State of the Art and Challenges, Workshop: Second Workshop on Humanities in the Semantic Web - WHiSe II at Co-located with the 16th International Semantic Web Conference (ISWC), Vienna, Austria


2016

Daga, E., d'Aquin, M., Motta, E. and Gangemi, A. (2016) An Incremental Learning Method to Support the Annotation of Workflows with Data-to-Data Relations, Knowledge Engineering and Knowledge Management, pp. 129-144, Springer


Tiddi, I., Bastianelli, E., Daga, E. and d'Aquin, M. (2016) DKA-robo: dynamically updating time-invalid knowledge bases using robots, Demo at 20th International Conference on Knowledge Engineering and Knowledge Management (EKAW2016), Bologna, Italy


Tiddi, I., Daga, E., Bastianelli, E. and d'Aquin, M. (2016) Update of time-invalid information in Knowledge Bases through Mobile Agents, Workshop: Integrating Multiple Knowledge Representation and Reasoning Techniques in Robotics (MIRROR-16) at 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2016), Deajeon, South Korea


Daga, E., d'Aquin, M. and Adamou, A. (2016) Addressing exploitability of Smart City data, IEEE International Smart Cities Conference (ISC2)


Adamou, A., Daga, E. and Isaksen, L. (eds.) (2016) WHiSe 2016 - Humanities in the Semantic Web, Workshop: 1st Workshop on Humanities in the Semantic Web (WHiSe 2016) at 13th ESWC Conference 2016, Anissaras, Greece, 1608, CEUR-WS.org


2015

Daga, E., d'Aquin, M., Gangemi, A. and Motta, E. (2015) Bottom-Up Ontology Construction with Contento, Demo at International Semantic Web Conference


Daga, E., Panziera, L. and Pedrinaci, C. (2015) BASIL: A Cloud Platform for Sharing and Reusing SPARQL Queries as Web APIs, Demo at International Semantic Web Conference


Daga, E., Panziera, L. and Pedrinaci, C. (2015) A BASILar Approach for Building Web APIs on top of SPARQL Endpoints, Workshop: Services and Applications over Linked APIs and Data (SALAD) at European Semantic Web Conference (ESWC), Portoroz, Slovenia


Daga, E., d'Aquin, M., Motta, E. and Gangemi, A. (2015) A Bottom-Up Approach for Licences Classification and Selection, Workshop: International Workshop on Legal Domain And Semantic Web Applications (LeDA-SWAn 2015) at 12th Extended Semantic Web Conference (ESWC 2015), Portoroz, Slovenia


Daga, E., d'Aquin, M., Adamou, A. and Brown, S. (2015) The Open University Linked Data - data.open.ac.uk, Semantic Web - Interoperability, Usability, Applicability, IOS Press


Daga, E., d'Aquin, M., Gangemi, A. and Motta, E. (2015) Propagation of Policies in Rich Data Flows, 8th International Conference on Knowledge Capture (K-CAP 2015)


2014

d'Aquin, M., Adamou, A., Daga, E., Liu, S., Thomas, K. and Motta, E. (2014) Dealing with Diversity in a Smart-City Datahub, Workshop: 5th Workshop on Semantics for Smarter Cities


Daga, E., d'Aquin, M., Gangemi, A. and Motta, E. (2014) Early analysis and debugging of linked open data cubes, Workshop: Workshop on Semantic Statistics at International Semantic Web Conference, Italy, CEUR-WS.org


2012

Presutti, V., Blomqvist, E., Daga, E. and Gangemi, A. (2012) Pattern-Based Ontology Design, Ontology Engineering in a Networked World


Daga, E. (2012) Towards a Theoretical Foundation for the Harmonization of Linked Data, Workshop: Doctoral Consortium at International Semantic Web Conference - ISWC 2012


2011

Svab-Zamazal, O., Daga, E., Dudas, M. and Svatek, V. (2011) Tools for Pattern-Based Transformation of OWL Ontologies, Demo at International Semantic Web Conference


2010

Presutti, V., Daga, E., Gangemi, A. and Blomqvist, E. (2010) eXtreme Design with content ontology design patterns, Workshop: Workshop on Ontology Patterns (WOP) at International Semantic Web Conference 2010


Blomqvist, E., Presutti, V., Daga, E. and Gangemi, A. (2010) Experimenting with eXtreme design, Workshop: Knowledge Engineering and Management by the Masses at 17th International Conference on Knowledge Engineering and Knowledge Management


Baldassarre, C., Daga, E., Gangemi, A., Gliozzo, A., Salvati, A. and Troiani, G. (2010) Semantic scout: making sense of organizational knowledge, 17th international conference on Knowledge engineering and management by the masses


Daga, E., Blomqvist, E., Gangemi, A., Montiel, E., Nikitina, N., Presutti, V. and Terrazas, B. (2010) Pattern based ontology design: methodology and software support Deliverable 2.5.2


2008

Daga, E., Presutti, V. and Salvati, A. (2008) Http://ontologydesignpatterns.org [ODP] and Evaluation WikiFlow, 5th Workshop on Semantic Web Applications and Perspectives, SWAP 2008


Daga, E., Presutti, V., Gangemi, A. and Salvati, A. (2008) http://ontologydesignpatterns.org [ODP], Poster at International Semantic Web Conference


2007

Gliozzo, A., Gliozzo, A., Gangemi, A., Presutti, V., Cardillo, E., Daga, E., Salvati, A. and Troiani, G. (2007) A collaborative semantic web layer to enhance legacy systems, International Semantic Web Conference