Sarah Shoilee

PhD Candidate, Vrije Universiteit Amsterdam
Department of Computer Science
Research Group: User-Centric Data Science
[s.b.a.shoilee@vu.nl] [cv]

Project: Pressing Matter – financed by the Dutch National Science Agenda (NWA)
PhD Supervisors: Dr. Victor de Boer; Prof. Dr. Jacco van Ossenbruggen & Prof. Dr. Susan Legene

My research focuses on fostering knowledge discovery in complex domains, particularly within provenance research for cultural heritage. I use data science approaches that combine Semantic Web technologies, Linked Data, and interdisciplinary methods to model, extract, and visualize contextual information. This work aims to improve data quality and interpretability, especially for colonial heritage collections, while supporting meaningful human-computer collaboration.

Building on my current work, my future research will continue to explore hybrid processes in data science, particularly within socio-technical systems where human and computational elements interact in complex ways. I am deeply interested in advancing interdisciplinary methods that bridge the gap between technical innovation and domain-specific expertise.

A central focus of my future work will be on Knowledge Discovery and Capture, encompassing the extraction, mining, and communication of structured and unstructured information across diverse data sources. I aim to contribute to the development of frameworks and tools using Semantic Web technologies and Linked Data that help make knowledge more findable, interoperable, actionable, and reusable—both for machines and for humans.

Current Research Areas

  • Semantic Web and Linked Data.
  • Digital Humanities.
  • Knowledge Discovery.
  • Hybrid and Interdisciplinary Data Science.

Most Relevant Papers

A selection of papers that represent my research interests and style.

Enhancing Provenance Research with Linked Data: A Visual Approach to Knowledge Discovery
Sarah Binta Alam Shoilee, Annastiina Ahola, Heikki Rantala, Eero Hyvönen, Victor de Boer, Jacco van Ossenbruggen, Susan Legene.
SemDH 2025 Workshop, co-located with ESWC 2025

A Framework for Evaluating Entity Alignment Impact on Downstream Knowledge Discovery
Sarah Binta Alam Shoilee, Victor de Boer, Jacco van Ossenbruggen.
EKAW 2024

Polyvocal Knowledge Modelling for Ethnographic Heritage Object Provenance.
Sarah Binta Alam Shoilee, Victor de Boer, Jacco van Ossenbruggen.
SEMANTICS 2023