Work package 2. Development of the DPOs Translator and Recommender
🎯 Objectivs
WP2 developes an AI recomendation system that take advantage of the WP1 DPOs data lake to suppoert cross-domain omics research. It translate the search quieries into domain specific isntructions across OMICs fields. Allowing researchers to optain the most relevent DPOs workflow for a single domain or compine different domains to create a containarised DPO. Benchmarking and expert validation ensure recommendations are accurate, relevant, and reproducible.
🚧 Tasks
Task 2.1 – Cross-domain DPO Translator
2.1.a – Design a linguistic framework to annotate data objects and tools across different omics domains (genomics, transcriptomics, proteomics, metabolomics). 2.1.b – Train a natural language processing (NLP) model to translate scientific questions into domain-specific instructions for DPO retrieval and workflow execution. 2.1.c – Integrate the translator with the DPO data lake to enable accurate cross-domain mapping without hallucination.
Task 2.2 – DPO Recommender
2.2.a – Develop an AI-based recommender system to suggest relevant DPOs for a given scientific query. 2.2.b – Implement mechanisms to combine or adapt DPOs from multiple domains, returning containerised workflows ready for execution. 2.2.c – Ensure the recommender operates reliably in environments with limited GPU resources, using agent-style support models grounded in the curated DPO data lake.
Task 2.3 – DPO Recommender Benchmarking
2.3.a – Conduct cross-domain pilot studies to evaluate the relevance and accuracy of DPO recommendations. 2.3.b – Collect expert assessments to verify that the recommended workflows meet domain-specific scientific needs. 2.3.c – Analyse performance metrics, document benchmarking results, and identify areas for optimisation and improvement.
🚚 Deliverables:
- D2.1 - Data Processing Obect Translator
- D2.2 - Data processing object Recomender
- D2.3 - Cross-domain benschmark/application of the Data Proecssing Object Recommeder