How the project was organised
Our work plan was designed around a workflow containing three components:
- Identify data sources
- Define standards and develop processes to FAIRify the data
- Implement an infrastructure to host the FAIRified data
Overview
Six Work Packages (WPs) are responsible for implementaing the tasks in the workflow above. Two groups called 'Squads' are responsible for making sure these tasks deliver positive outcomes for real world data. Squads draw expertise from each of the Work Packages, and work to develop and improve FAIRification techniques and tools.
Members of the Squads also organise Bring Your Own Data (BYOD) sessions for representatives from industry (EFPIA), where industry partners bring their own data and learn how to FAIRify it.
Once a quarter, there is a Squad 'Face to Face' meeting. These meetings provide a focal point for iterating on FAIRification processes, through the exchange of experience and knowledge between Squads. After each Squad Face to Face, the latest progress is reported back to the rest of the FAIRplus consortium.
The Work Packages
WP1: Identification data sources for FAIRification
Tasks
- To select datasets from IMI projects and industry to make FAIR, including three pilot IMI datasets.
- To evaluate the relevance, ELSI (Ethical, Legal and Social Implications) requirements, and scientific value of these datasets for FAIRification.
- To propose and implement selection criteria for dataset identification. These criteria will include societal impact and the specific requirements for metadata relevant to the EFPIA data sets, which will be used as use-case for WP2.
Deliverables
- First three pilot datasets selected and available. (June 2019)
- Selection criteria and guidelines for data sources from IMI projects and EFPIA internal databases. A report describing the criteria, including the underlying rationale with respect to ELSI, scientific impact, societal impact, and ease of access. (December 2019)
- The first 15 IMI datasets selected and available for inclusion in WP2-4 processes. (December 2020)
- Finalised selection criteria and guidelines for data sources, taking into account the practical operational experiences of WP2 and WP3. This includes a description of all 20 IMI and EFPIA data sources selected. (December 2021)
WP leaders
Participants
AstraZeneca, Bayer, Boehringer, BSC, Fraunhofer, GSK, IMIM (ELIXIR-ES), Janssen, Novartis, SIB (ELIXIR-CH), University of Luxembourg (ELIXIR-LU), UPS.
WP2: Standards definition and process development
Tasks
- To define community standards to describe, identify and interlink key elements of the datasets.
- To identify metrics to measure the level of FAIRness of the datasets, pre- and post FAIRification.
- To define the FAIRification process by implementing the standards and the metrics through Bring Your Own Data (BYOD) workshops.
- To provide methods for estimating the return on investment for FAIRified datasets.
Deliverables
- FAIR Cookbook. This will be in the form of recipes containing guidance on how to use a stack of standards, metrics, and related services to FAIRifying datasets. (December 2021)
- BYOD Guidelines. This document will describe best practice for organising Bring Your Own Data events (BYODs), as well as explain how to FAIRify specific data types, for different maturation levels. (December 2021)
- Report on BYODs. This is a set of documents reporting on the work performed at each BYOD event, including inputs/outputs, exemplars and lesson learned. (December 2021)
WP leaders
Deputy WP lead: Philippe Rocca-Serra (University of Oxford, ELIXIR UK; philippe.rocca-serra@oerc.ox.ac.uk)
Participants
AstraZeneca, BSC (ELIXIR-ES), EMBL-EBI, Fraunhofer, Heriot Watt University (ELIXIR-UK), IMIM (ELIXIR-ES), Imperial College London, Janssen, Maastricht University (ELIXIR-NL), Novartis, University of Luxembourg (ELIXIR-LU), University of Manchester (ELIXIR-UK), University of Oxford (ELIXIR-UK), UPS.
WP3: Implementation and infrastructure
Tasks
- To determine and to test criteria for hosting solutions for IMI FAIR datasets.
- To deliver and execute an iterative deployment plan using the identified hosting platforms, standards (WP2) and tools (WP3).
- To apply and extend the FAIR tools stack.
- To validate the progress of the FAIRification of datasets by using metrics for database interoperability.
- To create a FAIRification sustainability plan with recommendations for future projects.
Deliverables
- First phase exemplar IMI projects FAIRified. A list of FAIRified datasets from exemplar IMI projects, indicating the level of FAIRness. (December 2019)
- IMI FAIR metrics publication. (December 2020)
- A report on IMI projects for data types and current technical solutions detailing phasing of implementation. (December 2020)
- Confidential report summarising the work on the FAIRification of the first EFPIA exemplar. (June 2021)
- An IMI FAIR Data Catalogue for IMI data that supports data discovery and where IMI FAIRification progress and metrics can be publicly indicated. (December 2021)
- A technical feasibility report with an exemplar for each data type outlining local, remote and cloud based options for FAIRification hosting. (December 2021)
- A FAIRification guidance tool for IMI data that will allow users to select both community and project delivered tools and processes for specific FAIRification use cases. (December 2021)
- Sustainability White Paper.(December 2021)
WP leaders
Participants
AstraZeneca, Bayer, Boehringer, BSC (ELIXIR-ES), EMBL-EBI, GSK, IMIM (ELIXIR-ES), Imperial College London, Janssen, PHACTS, The Hyve, University of Oxford (ELIXIR-UK), University of Manchester (ELIXIR-UK), Maastricht University (ELIXIR-NL), University of Luxembourg (ELIXIR-LU).
WP4: Communication and outreach
Tasks
- To educate the next-generation of experts how to FAIRify data sets within IMI projects, EFPIA partners and beyond.
- To reach out to the large community of stakeholders (SMEs, policy makers, and scientists) through various communication channels.
- To demonstrate the value of the FAIRplus FAIRification process with convincing science cases (in collaboration with EFPIA).
- To provide FAIRplus dissemination packages with guidelines for the FAIRification process.
Deliverables
- Description of the objective evaluation criteria for the Fellowship Programme application review. The criteria will include a description of how equal treatment of all applications will be ensured for those receiving public funding. (September 2019)
- First FAIR Innovation and SME event. (December 2019)
- FAIRplus Fellowship Programme. This includes a link to the documentation of the completed Fellowship Programme, including its structure, training modules, incentives, and legal framework. (March 2020)
- Report on Policy-maker engagement strategy. (March 2020)
- A use case dissemination package. This will include three convincing use cases demonstrating the scientific value of FAIRification, and appealing to both researchers and research support staff. (December 2021)
WP leaders
Participants
AstraZeneca, Bayer, Boehringer, BSC (ELIXIR-ES), ELIXIR Hub, Fraunhofer, GSK, IMIM (ELIXIR-ES), Janssen, Lygature, The Hyve, University of Oxford (ELIXIR-UK), UPS.
WP5: Project management, coordination, dissemination and sustainability
Tasks
- To establishment a governance structure with appropriate participation from public and EFPIA members and taking diversity into account.
- To periodically monitor the project, including work plan execution, quality assurance, data management, finance execution, risks, communication and innovation management activities.
- To define, implement and monitor the project dissemination and communication strategy.
- To develop and update a data management plan along the project lifecycle.
- To evaluate, develop and implement a sustainability plan for each relevant project result (in collaboration with WP2, WP3 and WP4).
Deliverables
- Website, project governance and communication assets and guidelines available. (April 2019)
- Data Management Plan (DMP). Link to the confidential Data Management Plan which will include details about original data sources, tools and high level derived data (metadata) generated as a result of the FAIRification process, and how FAIR principles are guaranteed.(June 2019)
- FAIRplus Handbook and project monitoring. Link to the initial version of the project handbook including guidelines (i.e. governance, communication) and assets as well as project performance indicators. (December 2019)
- KPI Dashboard. Key Performance Indicators of FAIRplus activities (July 2020).
- Sustainability Plan. Consolidated sustainability plan that will incorporate inputs of all participants and specific work carried out by other WPs. (December 2020)
- Updated version of the Data Management Plan made available to the FAIRplus participants. (December 2020)
- Updated FAIRplus Handbook and project monitoring, with a focus on lessons learnt. (December 2021)
- FAIRplus overview of the scientific publications. Collective view of the scientific publications generated during the project time frame. (June 2022)
- Data Management Plan updated. Updated version of the DMP made available to the FAIRplus participants. (June 2022)
WP leaders
Participants
AstraZeneca, Bayer, Boehringer, BSC (ELIXIR-ES), ELIXIR Hub, Fraunhofer, GSK, IMIM (ELIXIR-ES), Janssen, Lygature, The Hyve, University of Oxford (ELIXIR-UK), University of Luxembourg (ELIXIR-LU), UPS.
WP6: Ethics Requirements
Tasks
- To establish an Ethics Board to monitor the ethics and data protection issues raised by the project.
- To evaluate the ethics/legal risks related to the data processing activities of the project.
- To ensure that there is a lawful basis for processing the data used in the project and that the appropriate technical and organisational measures are in in place to safeguard the rights of the data subjects.
Deliverables
- General ethics requirement. Ethics Board: an Ethics Board established, comprising one jurist, especially competent in data protection issues, one philosopher and one social scientist. This is to ensure that all facets of the ethical aspects of the project can be addressed. (31 March 2019)
- A procedure to identify ELSI issues and make sure data is processed lawfully, with appropriate safeguards for personal data in the selected datasets. (30 June 2019)
- A set of measures that the various persons involved in data stewardship, data governance and data use need to document and respect when an IMI projects agrees to give access to their data. (31 March 2019)
WP leaders
The Squads
The term 'Squad' comes from the agile methodology in software engineering, where squads are autonomous units cutting across the hierarchical structure. Squads in FAIRplus are composed of key Work Package experts (both public and EFPIA). They bring the necessary expertise and technical skills required for individual dataset FAIRification.
Both Squads work independently to FAIRify datasets from the four pilot projects, whilst sharing their experience and establishing generalised practices that should be applicable to subsequent FAIRification efforts.
Squads refine their approaches over three-month long release cycles, making their datasets incrementally more FAIR with each release. The Squad members may change after each release based on the expertise needed to swarm the new problem/use case.
The Squads operate through weekly calls and they meet face-to-face once in every cycle to discuss their approaches, review work done during that release cycle, and plan for the next release. The first two meetings took place in April 2019 in Hinxton and in July 2019 in London, UK.