grant

UniProt: A Protein Sequence and Function Resource for Biomedical Science

Organization EUROPEAN MOLECULAR BIOLOGY LABORATORYLocation HEIDELBERG, GERMANYPosted 18 Sept 2014Deadline 31 May 2026

NIHUS FederalResearch GrantFY2025AI systemAccelerationAffectAmino Acid SequenceArtificial IntelligenceBiomedical ResearchBody TissuesCatalogsCell BodyCellsCollaborationsCommunitiesComplexComputer ReasoningCuesDNA Molecular BiologyDataData SetDevelopmentDiathesisDiseaseDisease susceptibilityDisorderDistance LearningDrugsEnsureEnvironmentFAIR dataFAIR guiding principlesFAIR principlesFTPFindable, Accessible, Interoperable and Re-usableFindable, Accessible, Interoperable, and ReusableGeneralized GrowthGenomicsGenotypeGleanGrowthHealthHereditary DiseaseHumanHuman GeneticsHuman MicrobiomeInborn Genetic DiseasesIndividualInherited disorderInternationalInternetInvestigatorsKnowledgeKnowledge ExtractionLiteratureMachine IntelligenceMachine LearningMacromolecular StructureMedicationMethodsModern ManModernizationMolecularMolecular BiologyMolecular Sequence DataMolecular StructureOntologyOrganOrthologOrthologous GeneOutcomePaperPathway interactionsPatternPharmaceutical PreparationsPhenotypePlayPreparednessPrimary Protein StructureProcessProductionProtein ArrayProteinsPublicationsReadabilityReadinessResearchResearch PersonnelResearch ResourcesResearchersResourcesRoleScienceScientific PublicationShapesSiteStandardizationStructureSystemTechnologyTissue GrowthTissuesTrainingTriageVariantVariationWWWWorkbiomedical data sciencebiomedical resourcecatalogcomputable knowledgeconferenceconventioncrowd sourcecrowd-sourcingcrowdsourcecrowdsourcingdata accessdata re-usedata reusedata to traindataset to traindeep learningdeep learning based modeldeep learning methoddeep learning modeldeep learning strategydesigndesigningdevelopmentaldiscovery miningdrug/agentexperienceexperimentexperimental researchexperimental studyexperimentsformycin 5'-triphosphateformycin A 5'-triphosphateformycin triphosphategenetic architecturegenomic variationhack dayhackathonhackfesthereditary disorderheritable disorderhuman diseasehuman-associated microbiomeimprovedinborn errorinherited diseasesinherited genetic diseaseinherited genetic disorderinnovateinnovationinnovativeknowledge curationknowledgebaselearning engagementliability to diseaseliterature miningliterature searchingmachine based learningmachine learning based methodmachine learning methodmachine learning methodologiesmacromolecular assemblymeetingmeetingsnew technologynovel technologiesontogenypathogenpathwaypersonalized diagnosispersonalized diagnosticsprecise diagnosticsprecision diagnosticsprognosticprotein functionprotein sequenceremote learningresponsesocial mediasocial rolesummitsymposiasymposiumtext miningtext searchingtraining datawebweb sitewebinarwebsiteworld wide web

— or —

Get email alerts for similar roles

Full Description

PROJECT SUMMARY/ABSTRACT
This project continues the development of the UniProt Knowledgebase, which aims to provide the scientific
community with a comprehensive, high-quality, and freely accessible resource of protein sequences and
functional information. Proteins are an essential bridge between human genetics, the environment and
phenotype. While human genetics has increasing power to find correlations between genotype and phenotype,
knowledge of how proteins function, provided by UniProt, is essential for the mechanistic understanding critical
to develop health outcomes through improved and personalized diagnostics, prognostics, and treatments.
Biomedical research is being revolutionized by methods from the field of Artificial Intelligence, particularly
Machine Learning (ML) approaches such as Deep Learning (DL). These approaches now outstrip the ability of
humans in many fields and are state-of-the-art when sufficient data is available. UniProt provides gold standard
training data for hundreds of ML applications in biomedical research. The work in this proposal will enhance the
readiness of UniProt for use in ML and will integrate ML methods to enhance our efficiency.
UniProt curators extract and synthesize experimental knowledge of proteins from papers in human and machine-
readable forms using a range of standard ontologies. This proposal will further structure protein knowledge in
UniProt, developing complete, machine-readable catalogs of the functional impact of human variation and of
human protein networks and complexes, essential to understanding human disease. Efficiency of curation will
be improved using DL models, developed in collaboration with text mining experts, to automate the identification
of relevant papers and accelerate extraction of knowledge. This extracted knowledge will be validated by our
expert curators and also the wider research community who will be actively engaged to further scale curation.
ML approaches will also be used to infer annotations for proteins with no experimental characterization, using
community challenges to develop faster, more accurate, scalable approaches to annotate the deluge of
uncharacterized proteins.
UniProt is an exemplar FAIR resource and has served the scientific community with metronomic data releases
despite an exponential growth in data volumes. Streamlined production processes will scale efficiently and
sustainably with both the growing data volume and complexity. We will explore novel technologies to ensure the
continued timely release of data to the community according to the FAIR principles.
UniProt is an international hub of protein data that serves hundreds of thousands of users annually. We will
continue using user-centric approaches to develop the UniProt website in response to user needs and new data
types. We will engage with our stakeholders and collaborators by introducing an annual strategic partnership
meeting. We will engage our communities through webinars, social media, hackathons and attendance at
scientific meetings to broaden the efficient and impactful use of our data.

Grant Number: 5U24HG007822-12
NIH Institute/Center: NIH
Principal Investigator: Alex Bateman

Agency Plan

7-day free trial

Unlock procurement & grants

Upgrade to access active tenders from World Bank, UNDP, ADB and more — with email alerts and pipeline tracking.

$29.99 / month

🔔Email alerts for new matching tenders
🗂️Track tenders in your pipeline
💰Filter by contract value
📥Export results to CSV
📌Save searches with one click

Start 7-day free trial →

Explore more

📍 More grants in GERMANY 🏷 More NIH opportunities 🏷 More US Federal opportunities 🏷 More Research Grant opportunities 🏢 All nih_reporter opportunities

UniProt: A Protein Sequence and Function Resource for Biomedical Science

Full Description

Unlock procurement & grants

Explore more

More from EUROPEAN MOLECULAR BIOLOGY LABORATORY

More from EUROPEAN MOLECULAR BIOLOGY LABORATORY