grant

Flexible NLP toolkit for automatic curation of outcomes for breast cancer patients

Organization MAYO CLINIC ARIZONALocation SCOTTSDALE, UNITED STATESPosted 1 Aug 2022Deadline 31 Jul 2027
NIHUS FederalResearch GrantFY2025AI algorithmActive Follow-upAdherenceAdoptedAgeAnxietyBiologicalBlack PopulationsBlack groupBlack individualBlack peopleBlacksBreastBreast CancerBreast Cancer PatientBreast Cancer Risk FactorBreast Cancer TreatmentBreast Tumor PatientCaliforniaCancer BurdenCancer PatientCancer StagingCancer TreatmentCancersCessation of lifeClinicClinicalCollaborationsCommunicationComputer Software ToolsComputerized Medical RecordDataData BasesData CollectionDatabasesDeathDetectionDevelopmentDiagnosisDiagnostic Neoplasm StagingDiseaseDisorderDisparitiesDisparityDistant CancerDistant MetastasisEarly DiagnosisElectronic Medical RecordEndocrine TherapyEpidemiologistEthnic OriginEthnicityEvaluationFatigueFundingFutureGeneral RadiologyGuidelinesHealth CareHealth systemHispanic FemalesHispanic PopulationsHispanic WomenHispanic groupHispanic individualHispanic peopleHispanicsHistoryHormonal TherapyHospitalsHourHumanImmune mediated therapyImmunologically Directed TherapyImmunotherapyInformaticsInstitutionInsurance CoverageInsurance StatusInterventionInvestigatorsLack of EnergyLeadLearningMalignant Breast NeoplasmMalignant Neoplasm TherapyMalignant Neoplasm TreatmentMalignant NeoplasmsMalignant TumorMalignant neoplasm of prostateMalignant prostatic tumorManualsMedical centerMental DepressionMetastasisMetastasizeMetastatic LesionMetastatic MassMetastatic NeoplasmMetastatic TumorMethodologyMethodsMissionModelingModern ManMorbidityMorbidity - disease rateNational Cancer BurdenNatural Language ProcessingNauseaNeoplasm MetastasisNeoplasm StagingNon-HispanicNonhispanicNot Hispanic or LatinoOncologistOperative ProceduresOperative Surgical ProceduresOutcomePaperPathologyPathology ReportPatient outcomePatient-Centered OutcomesPatient-Focused OutcomesPatientsPatternPb elementPerformancePopulationPopulation HeterogeneityPrimary NeoplasmPrimary TumorProcessPrognosisProstate CAProstate CancerProstate malignancyPsyche structureQOLQuality of lifeRaceRacesRadiationRadiologyRadiology SpecialtyRecording of previous eventsRecurrenceRecurrentRecurrent Malignant NeoplasmRecurrent Malignant TumorRegistriesReportingResearch PersonnelResearchersRoleRunningSES disparitySecondary NeoplasmSecondary TumorSiteSoftware ToolsStage at DiagnosisStructureSurgicalSurgical InterventionsSurgical ProcedureT-StageTeam NursingTechnologyTestingTextTimeTumor StagingTumor SubtypeTumor stageUniversity HospitalsValidationVisitactive followupagesanti-cancer researchanti-cancer therapyartificial intelligence algorithmbiologicbiological systemsbreast cancer riskbreast cancer survivalcancer classificationcancer locationcancer metastasiscancer preventioncancer recurrencecancer registrycancer researchcancer sitecancer survivalcancer therapycancer-directed therapychemotherapyclinical centerclinical encounterco-morbidco-morbiditycomorbiditycomputer scientistcostcurating datadata basedata curationdata integrationdepressiondevelopmentaldiverse populationsearly detectionexperienceexpression subtypesflexibilityflexiblefollow upfollow-upfollowed upfollowupheavy metal Pbheavy metal leadheterogeneous populationhistorieshormone therapyimmune therapeutic approachimmune therapeutic interventionsimmune therapeutic regimensimmune therapeutic strategyimmune therapyimmune-based therapiesimmune-based treatmentsimmuno therapyindividuals with breast cancerinformatics toollow income countrymalignancymalignant breast tumormentalmolecular sub-typesmolecular subsetsmolecular subtypesmulti-modal datamulti-modal datasetsmultidisciplinarymultimodal datamultimodal datasetsnatural language understandingneoplasm registryneoplasm/canceropen sourceoutcome predictionpatient oriented outcomespatient populationpatients with breast cancerperson with breast cancerphysical conditioningphysical healthpopulation basedpopulation diversityracialracial backgroundracial originradiologistrelational databasesocial rolesocio-economic disparitysocio-economic inequalitysocio-economic inequitysocioeconomic disparitysocioeconomic inequalitysocioeconomic inequitysoftware toolkitsurgerysurveillance datasurvival disparitytooltreatment and outcometreatment planningtrendtumor cell metastasisvalidations
Sign up free to applyApply link · pipeline · email alerts
— or —

Get email alerts for similar roles

Weekly digest · no password needed · unsubscribe any time

Full Description

Project summary/Abstract
Breast cancer has the largest number of new cases in world (11.7%). Although the prognosis of

breast cancer patients is generally favorable due to early detection and comprehensive treatment,

20%–30% of patients will still develop distant metastases and cases with progressive stage only

have a median two-year survival time. Breast cancer is widely recognized as a heterogeneous

disease in the sense of both primary tumor metastatic capacity and time to metastatic spread of

disease. High-quality population-based cancer surveillance data are needed to: (1) describe

cancer burden, patterns, and outcomes in order to (2) inform cancer prevention, detection and

control activities; and (3) evaluate interventions on the basis of past and future trends so that

optimal approaches to alleviate burden and suffering from cancer can be adopted. However, the

laborious manual curation process makes the population wise surveillance data collection

challenging. It has been shown in studies that a large percentage of total registry cost is devoted

to labor for data curation, even in the low-income countries. In this project, our mission is to build

a flexible NLP toolset that can be executed locally at the institution level and will curate the clinical

and patient-centered outcomes of breast cancer patients by parsing longitudinally acquired clinic

notes, radiology and pathology reports. In order to test the generalizability of the tools and to

initiate their deployment for data collection, we will partner with both Georgia SEER and California

state cancer registry and will curate the outcome data of past 10-years breast cancer patients

from two institutions across US representing diverse patient populations - Emory University

hospital (Georgia) and Stanford Medical Center (California). We will leverage the previously

developed tools and technologies and extend them to automatically curate the clinical and patient-

centered outcome data – recurrence date and site of recurrence, treatment administered, mental

and physical outcomes – from clinic notes and convert these into structured and query-able

format. The NLP tools will be dockerized and run locally at the hospital registry level for automated

outcome curation. Finally, the NLP extracted outcomes will be shared with State Cancer registry

for evaluation. From a methodological perspective, the framework and the open-source software

tools developed can be employed for cancer research beyond the scope of our project for curating

outcomes regardless of the problem domain.

Grant Number: 5U01CA269264-04
NIH Institute/Center: NIH

Principal Investigator: Imon Banerjee

Sign up free to get the apply link, save to pipeline, and set email alerts.

Sign up free →

Agency Plan

7-day free trial

Unlock procurement & grants

Upgrade to access active tenders from World Bank, UNDP, ADB and more — with email alerts and pipeline tracking.

$29.99 / month

  • 🔔Email alerts for new matching tenders
  • 🗂️Track tenders in your pipeline
  • 💰Filter by contract value
  • 📥Export results to CSV
  • 📌Save searches with one click
Start 7-day free trial →