SlideShare a Scribd company logo
1 of 34
Download to read offline
Mapping biomedical literature into
UMLS concepts
MetaMap
Presented By: Osama Jomaa
Miami University
Unified Medical Language
Motivation
“... to facilitate the development of computer
systems that behave as if they "understand"
the meaning of the language of biomedicine
and health.”
National Library of Medicine
UMLS Components
1.Metathesaurus
+1 Million biomedical concepts from over 100 vocabularies
2..Semantic Network
133 categories & 54 relationships.
3..Specialist Lexicon & Lexical Tools
Software programs to aid in NLP
Meta thesaurus
Patient Care Controlled Terms
Biomedical Vocabs from
Different
Languages
Clinical/Health Services Research
Health Services Billing
Biomedical Literature Catalogs
Public Health Statistics
.
.
.
.
.
5,000,000biomedicalterm
1,000,000Concepts
+ 100 Source Vocabs
Relational DB Tables
Metathesaurus
●Concepts are classified into categories:
–Diagnosis
–Procedures & Supplies
–Diseases
–….
●Concepts have unique identifier.
●Concepts have preferred terms.
●Concepts can be grouped into subsets via applying
filters.
Source Vocabularies Categories
One Concept Many Terms
One concept can have many terms in multiple
vocabularies.
Example: Atrial Fibrillation
Preferred Terms
Concept: Hodgkin's Disease
Unique Identifiers
● Concept Unique Identifier (CUI)
Link all the names in all the source vocabs that mean the same
to one concept and assign a unique identifier, CUI, to it.
● Lexical Unique Identifier (LUI)
Are lexical variants for the concepts detected using Lexical
Variant Generator (LVG) program.
● String Unique Identifier (SUI)
Represents variations in the char set, upper-lower case, or
permutation difference.
● Atom Unique Identifier (AUI)
Every occurrence of a string in each source vocab is assigned a
unique identifier, AUI.
Semantic Network
● Semantic Types
+133 types, each MT concept assigned one semantic type at
least.
● Semantic Relationships
54 relationaship. Is-A is the most important.
Semantic Network
Semantic Types Examples:
✔ Organisms
✔ Anatomical structures
✔ Biologic function
✔ Chemicals
✔ Physical objects
Entity
Event
Semantic Relationships Examples:
✔ Physically related to
✔ Spatially related to
✔ Temporally related to
✔ Functionally related to
✔ Conceptually related to
Lexical Tools
●The Specialist Lexicon
Is an English lexicon (dictionary) that includes over 200,000
biomedical terms from a variety of source to aid in NLP.
●Lexical Variant Generator (LVG)
●Norm
Normalizer
●Wordind
Tokenizer
MetaMap
Why Concept Identification?
● Information extraction/Data mining
● Classification/Categorization
● Text summarization
● Question answering
● Literature-based Knowledge Discovery
Example
Phrase: “lung cancer.”
Meta Candidates (8):
1000 Lung Cancer {MDR,DXP} (Malignant neoplasm of lung) [Neoplastic Process]
1000 Lung Cancer (Carcinoma of lung) [Neoplastic Process]
861 Cancer (Malignant Neoplasms) [Neoplastic Process]
861 Lung [Body Part, Organ, or Organ Component]
861 Cancer (Cancer Genus) [Invertebrate]
861 Lung (Entire lung) [Body Part, Organ, or Organ Component]
861 Cancer (Specialty Type - cancer) [Biomedical Occupation or
Discipline]
768 Pneumonia [Disease or Syndrome]
Meta Mapping (1000):
1000 Lung Cancer (Carcinoma of lung) [Neoplastic Process]
Meta Mapping (1000):
1000 Lung Cancer (Malignant neoplasm of lung) [Neoplastic Process]
The Algorithm
MetaMap Options
● Word Sense Disambiguation (-y)
Determines which concept is the best
choice using surrounding context.
● Negation (--negx)
Identifies negated entities.
Examples
●
WSD Examples
–“Fifteen (6.4%) of 234 colds treated with placebo ..”
●
Cold (cold temperature) [npop]
●
Cold (Common cold) [dsyn]
●
Cold (Cold Sensation) [phsf]
–“.. the drugs were compared in two four-point, double-blind
bioassays.”
●
Double (Diplopia) [dsyn] vs. Double (Duplicate) [ftcn]
●
Blind (Blind Vision) [dsyn] vs. BLIND (Blinded) [reasa] vs. Blind (Visually
impaired persons) [podg]
●
Bioassays (Biological Assay) [lbpr]
Examples
● Negation Example
– “There is no focal infiltrate or pleural effusion.”
– --negex output(in addition to normal output):
NEGATIONS:
Negation Type:nega
Negation Trigger: no
Negation PosInfo: 9/2
Negated Concept: C0332448:Infiltrate
Concept PosInfo: 18/10
Negation Type:nega
Negation Trigger: no
Negation PosInfo: 9/2
Negated Concept: C2073625:pleural effusion, C0032227:Pleural Effusion
Concept PosInfo: 32/16
Other Options
●
-@ --WSD <hostname> : Which WSD server to use.
●
-8 --dynamic_variant_generation : dynamic variant generation
●
-D --all_derivational_variants : all derivational variants
●
-J --restrict_to_sts <semtypelist> : restrict to semantic types
●
-K --ignore_stop_phrases : ignore stop phrases.
●
-R --restrict_to_sources <sourcelist> : restrict to sources
●
-V --mm_data_version <name> : version of MetaMap data to use.
●
-X --truncate_candidates_mappings : truncate candidates mapping
●
-Y --prefer_multiple_concepts : prefer multiple concepts
●
-Z --mm_data_year <name> : year of MetaMap data to use.
●
-a --all_acros_abbrs : allow Acronym/Abbreviation variants
●
-b --compute_all_mappings : compute/display all mappings
●
-d --no_derivational_variants : no derivational variants
●
-e --exclude_sources <sourcelist> : exclude semantic types
●
-g --allow_concept_gaps : allow concept gaps
● -i --ignore_word_order : ignore word order
●
-k --exclude_sts <semtypelist> : exclude semantic types
●
-o --allow_overmatches : allow overmatches
●
-r --threshold <integer> : Threshold for displaying candidates.
●
MetaMap Output Formats
● Human-readable outputp
● MetaMap Machine Output (MMO)
● XML output
● Colorized MetaMap output (MetaMap 3D)
● Fielded (MMI) Outputs
Human Readable
Phrase: "heart attack"
Meta Candidates (8):
1000 Heart attack (Myocardial Infarction) [Disease or Syndrome]
861 Heart [Body Part, Organ, or Organ Component]
861 Attack, NOS (Onset of illness) [Finding]
861 Attack (Attack device) [Medical Device]
861 attack (Attack behavior) [Social Behavior]
861 Heart (Entire heart) [Body Part, Organ, or Organ Component]
861 Attack (Observation of attack) [Finding]
827 Attacked (Assault) [Injury or Poisoning]
Meta Mapping (1000):
1000 Heart attack (Myocardial Infarction) [Disease or Syndrome]
Machine Output
candidates([
ev(-1000, 'C0027051', 'Heart attack', 'Myocardial Infarction', [heart,attack], [dsyn],
[[[1,2],[1,2],0]], yes, no, ['MEDLINEPLUS], [0/12]),
ev(-861, 'C0018787', 'Heart', 'Heart', [heart], [bpoc], [[[1,1],[1,1],0]], yes, no, ['AIR'],[0/5]),
ev(-861, 'C0277793', 'Attack, NOS', 'Onset of illness', [attack], [fndg], [[[2,2],[1,1],0]],
yes, no, ['MTH'], [6/6]),
ev(-861, 'C0699795', 'Attack', 'Attack device', [attack], [medd] [[[2[medd],[[[2,2],
[1,1],0]],2] [1 1] 0]] yesyes, nono, ['MTH'[ MTH ,'MMSL']MMSL ], [6/6])[6/6]),
ev(-861, 'C1261512', attack, 'Attack behavior', [attack],[socb], [[[2,2],[1,1],0]], yes, no,
['MTH','PSY','AOD'], [6/6]),
ev(-861, 'C1281570', 'Heart', 'Entire heart', [heart], [bpoc], [[[1,1],[1,1],0]], yes, no,
['MTH','SNOMEDCT'], [0/5]),
Ev(-861, , 'C1304680',, 'Attack',, 'Observation of attack',, [attack],,[fndg], [[[2,2],
[1,1],0]],yes, no, ['MTH','SNOMEDCT'], [6/6]),
ev(-827, 'C0004063', 'Attacked', 'Assault', [attacked], [inpo], [[[2,2],[1,1],1]], yes, no,
['ICD10AM'], [6/6])]).
Unformatted XML
<Candidate><CandidateScore>-
1000</CandidateScore><CandidateCUI>C0027051</CandidateCUI><CandidateM
atched>Heart attack</CandidateMatched><CandidatePreferred>Myocardial Infarction</CandidatePreferr
ed><MatchedWords
Count=2><MatchedWord>heart</MatchedWord><MatchedWord>attack</MatchedWord></Match
edWords><SemTypes Count=1><SemType>dsyn</SemType></SemTypes><MatchMaps
Count=1><MatchMap><TextMat
chStart>1</TextMatchStart><TextMatchEnd>2</TextMatchEnd><ConcMatchStart>1</ConcMatchStart><C
oncMa
tchEnd>2</ConcMatchEnd><LexVariation>0</LexVariation></MatchMap></MatchMaps><IsHead>yes</Is
Head><
IsOverMatch>no</IsOverMatch><Sources
Count=24><Source>MEDLINEPLUS</Source></Sources><ConceptPIs C
ount=1><ConceptPI><StartPos>0</StartPos><Length>12</Length></ConceptPI></ConceptPIs></Candid
ate>
Formatted XML
<Candidate>
<CandidateScore>-1000</CandidateScore>
<CandidateCUI>C0027051</CandidateCUI>
<CandidateMatched>Heart attack</CandidateMatched>
<CandidatePreferred>Myocardial Infarction</CandidatePreferred>
<MatchedWords
Count=2><MatchedWord>heart</MatchedWord><MatchedWord>attack</MatchedWord></MatchedWords>
<SemTypes>
<Count=1><SemType>dsyn</SemType></SemTypes>
<MatchMaps Count=1>
<MatchMap>
<TextMatchStart>1</TextMatchStart>
<ConcMatchEnd>2</ConcMatchEnd>
<LexVariation>0</LexVariation>
</MatchMap>
</MatchMaps>
<IsHead>yes</IsHead>
<IsOverMatch>no</IsOverMatch>
<Sources Count=24><Source>MEDLINEPLUS</Source></Sources>
<ConceptPIs Count=1><ConceptPI><StartPos>0</StartPos><Length>12</Length></ConceptPI></ConceptPIs>
</Candidate>
MetaMap 3D
MetaMap: Technical Aspect
●
Download
–MetaMap API Underlying Architecture.
–MetaMap Java API.
●Extract and Install
–$ bzip2 -dc public_mm_linux_javaapi_{four-digit-year}.tar.bz2 | tar xvf -
–$ ./bin/install.sh
●
Starting MetaMap Server
$ ./bin/skrmedpostctl start #Start SKR Server
$ ./bin/wsdserverctl start #Start WSD Server (Optional)
$ ./bin/mmserver{two-digit-year} #Start MetaMap Server
MetaMap Java API
Two jar files contain the API:
✔ /src/javaapi/dist/MetaMapApi.jar
✔ /src/javaapi/dist/prologbeans.jar
Code Time :)
MetaMapApi api = new MetaMapApiImpl("localhost");
List<Result> resultList =
api.processCitationsFromFile("Abstract.txt");
Result result = resultList.get(0);
Code Time :)
for (Utterance utterance: result.getUtteranceList()) {
System.out.println("Utterance:");
System.out.println(" Id: " + utterance.getId());
System.out.println(" Utterance text: " + utterance.getString());
System.out.println(" Position: " + utterance.getPosition());
Code Time :)
for (PCM pcm: utterance.getPCMList()) {
System.out.println("Phrase:");
System.out.println(" text: " + pcm.getPhrase().getPhraseText());
System.out.println("Candidates:");
for (Ev ev: pcm.getCandidateList()) {
System.out.println(" Candidate:");
System.out.println(" Score: " + ev.getScore());
System.out.println(" Concept Id: " + ev.getConceptId());
System.out.println(" Concept Name: " + ev.getConceptName());
System.out.println(" Preferred Name: " + ev.getPreferredName());
System.out.println(" Matched Words: " + ev.getMatchedWords());
System.out.println(" Semantic Types: " + ev.getSemanticTypes());
System.out.println(" MatchMap: " + ev.getMatchMap());
System.out.println(" MatchMap alt. repr.: " + ev.getMatchMapList());
System.out.println(" is Head?: " + ev.isHead());
System.out.println(" is Overmatch?: " + ev.isOvermatch());
System.out.println(" Sources: " + ev.getSources());
System.out.println(" Positional Info: " + ev.getPositionalInfo());
}
Code Time :)
System.out.println("Mappings:");
for (Mapping map: pcm.getMappingList()) {
System.out.println(" Map Score: " + map.getScore());
for (Ev mapEv: map.getEvList()) {
System.out.println(" Score: " + mapEv.getScore());
System.out.println(" Concept Id: " + mapEv.getConceptId());
System.out.println(" Concept Name: " + mapEv.getConceptName());
System.out.println(" Preferred Name: " + mapEv.getPreferredName());
System.out.println(" Matched Words: " + mapEv.getMatchedWords());
System.out.println(" Semantic Types: " + mapEv.getSemanticTypes());
System.out.println(" MatchMap: " + mapEv.getMatchMap());
System.out.println(" MatchMap alt. repr.: " + mapEv.getMatchMapList());
System.out.println(" is Head?: " + mapEv.isHead());
System.out.println(" is Overmatch?: " + mapEv.isOvermatch());
System.out.println(" Sources: " + mapEv.getSources());
System.out.println(" Positional Info: " + mapEv.getPositionalInfo());
}
}
}
}

More Related Content

What's hot

Introduction to Named Entity Recognition
Introduction to Named Entity RecognitionIntroduction to Named Entity Recognition
Introduction to Named Entity RecognitionTomer Lieber
 
Introduction to text classification using naive bayes
Introduction to text classification using naive bayesIntroduction to text classification using naive bayes
Introduction to text classification using naive bayesDhwaj Raj
 
Bayesian Networks - A Brief Introduction
Bayesian Networks - A Brief IntroductionBayesian Networks - A Brief Introduction
Bayesian Networks - A Brief IntroductionAdnan Masood
 
Machine Learning - Accuracy and Confusion Matrix
Machine Learning - Accuracy and Confusion MatrixMachine Learning - Accuracy and Confusion Matrix
Machine Learning - Accuracy and Confusion MatrixAndrew Ferlitsch
 
Inference in Bayesian Networks
Inference in Bayesian NetworksInference in Bayesian Networks
Inference in Bayesian Networksguestfee8698
 
Machine Learning Performance metrics for classification
Machine Learning Performance metrics for classificationMachine Learning Performance metrics for classification
Machine Learning Performance metrics for classificationKuppusamy P
 
Introdução a Redes Neurais Recorrentes
Introdução a Redes Neurais RecorrentesIntrodução a Redes Neurais Recorrentes
Introdução a Redes Neurais RecorrentesMarlesson Santana
 
Prolog Programming Language
Prolog Programming  LanguageProlog Programming  Language
Prolog Programming LanguageReham AlBlehid
 
Deep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveyDeep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveySOYEON KIM
 
Information Extraction
Information ExtractionInformation Extraction
Information Extractionssbd6985
 

What's hot (20)

Hmm
HmmHmm
Hmm
 
Introduction to Named Entity Recognition
Introduction to Named Entity RecognitionIntroduction to Named Entity Recognition
Introduction to Named Entity Recognition
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Introduction to text classification using naive bayes
Introduction to text classification using naive bayesIntroduction to text classification using naive bayes
Introduction to text classification using naive bayes
 
Bayesian Networks - A Brief Introduction
Bayesian Networks - A Brief IntroductionBayesian Networks - A Brief Introduction
Bayesian Networks - A Brief Introduction
 
Machine Learning - Accuracy and Confusion Matrix
Machine Learning - Accuracy and Confusion MatrixMachine Learning - Accuracy and Confusion Matrix
Machine Learning - Accuracy and Confusion Matrix
 
Bayesian networks
Bayesian networksBayesian networks
Bayesian networks
 
K Nearest Neighbors
K Nearest NeighborsK Nearest Neighbors
K Nearest Neighbors
 
Inference in Bayesian Networks
Inference in Bayesian NetworksInference in Bayesian Networks
Inference in Bayesian Networks
 
Naive bayes
Naive bayesNaive bayes
Naive bayes
 
Decision tree
Decision treeDecision tree
Decision tree
 
DETR ECCV20
DETR ECCV20DETR ECCV20
DETR ECCV20
 
Hidden markov model ppt
Hidden markov model pptHidden markov model ppt
Hidden markov model ppt
 
Machine Learning Performance metrics for classification
Machine Learning Performance metrics for classificationMachine Learning Performance metrics for classification
Machine Learning Performance metrics for classification
 
Introdução a Redes Neurais Recorrentes
Introdução a Redes Neurais RecorrentesIntrodução a Redes Neurais Recorrentes
Introdução a Redes Neurais Recorrentes
 
Prolog Programming Language
Prolog Programming  LanguageProlog Programming  Language
Prolog Programming Language
 
AI: Planning and AI
AI: Planning and AIAI: Planning and AI
AI: Planning and AI
 
HIDDEN MARKOV MODEL AND ITS APPLICATION
HIDDEN MARKOV MODEL AND ITS APPLICATIONHIDDEN MARKOV MODEL AND ITS APPLICATION
HIDDEN MARKOV MODEL AND ITS APPLICATION
 
Deep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveyDeep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a survey
 
Information Extraction
Information ExtractionInformation Extraction
Information Extraction
 

Viewers also liked

VIVO Mini-Grant: Integrating the UMLS Ontology into VIVO for Linking Biomedic...
VIVO Mini-Grant: Integrating the UMLS Ontology into VIVO for Linking Biomedic...VIVO Mini-Grant: Integrating the UMLS Ontology into VIVO for Linking Biomedic...
VIVO Mini-Grant: Integrating the UMLS Ontology into VIVO for Linking Biomedic...Janos Hajagos
 
UMLS as a Resource for Virtual Data Warehouse Infrastructure KEY
UMLS as a Resource for Virtual Data Warehouse Infrastructure KEYUMLS as a Resource for Virtual Data Warehouse Infrastructure KEY
UMLS as a Resource for Virtual Data Warehouse Infrastructure KEYHMO Research Network
 
IRIT at clef 2015: A product search model for head queries
IRIT at clef 2015: A product search model for head queriesIRIT at clef 2015: A product search model for head queries
IRIT at clef 2015: A product search model for head queriesLamjed Ben Jabeur
 
Thesis slides - Definition and evluation of collaborative information retriev...
Thesis slides - Definition and evluation of collaborative information retriev...Thesis slides - Definition and evluation of collaborative information retriev...
Thesis slides - Definition and evluation of collaborative information retriev...UPMC - Sorbonne Universities
 
Information Retrieval UMLS
Information Retrieval UMLSInformation Retrieval UMLS
Information Retrieval UMLSsiddarajuss
 
Information Retrieval
Information RetrievalInformation Retrieval
Information RetrievalPhuong Pham
 
Description and retrieval of medical visual information based on language mod...
Description and retrieval of medical visual information based on language mod...Description and retrieval of medical visual information based on language mod...
Description and retrieval of medical visual information based on language mod...Antonio Foncubierta Rodriguez
 
Health IT & Voice of Patient
Health IT & Voice of PatientHealth IT & Voice of Patient
Health IT & Voice of PatientDr.Mahmoud Abbas
 
2. Triết học MBA (LS triết học Ấn Độ)
2. Triết học MBA (LS triết học Ấn Độ)2. Triết học MBA (LS triết học Ấn Độ)
2. Triết học MBA (LS triết học Ấn Độ)Hưng, Đinh Duy
 
Medical Information Retrieval and its Evaluation: an Overview of CLEF eHealth...
Medical Information Retrieval and its Evaluation: an Overview of CLEF eHealth...Medical Information Retrieval and its Evaluation: an Overview of CLEF eHealth...
Medical Information Retrieval and its Evaluation: an Overview of CLEF eHealth...lorrainegoeuriot
 
Challenges of managing Data Science Project
Challenges of managing Data Science ProjectChallenges of managing Data Science Project
Challenges of managing Data Science ProjectLamjed Ben Jabeur
 
Quels facteurs de pertinence pour la recherche de produits e-commerce ?
Quels facteurs de pertinence pour la recherche de produits e-commerce ?Quels facteurs de pertinence pour la recherche de produits e-commerce ?
Quels facteurs de pertinence pour la recherche de produits e-commerce ?Lamjed Ben Jabeur
 
Introduction au web sémantique : quand le lient fait sens
Introduction au web sémantique : quand le lient fait sensIntroduction au web sémantique : quand le lient fait sens
Introduction au web sémantique : quand le lient fait sensFICEL Hemza
 
Introduction to Information Retrieval & Models
Introduction to Information Retrieval & ModelsIntroduction to Information Retrieval & Models
Introduction to Information Retrieval & ModelsMounia Lalmas-Roelleke
 
Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...
Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...
Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...Lamjed Ben Jabeur
 
Définition et évaluation de modèles d'agrégation pour l'estimation de la pert...
Définition et évaluation de modèles d'agrégation pour l'estimation de la pert...Définition et évaluation de modèles d'agrégation pour l'estimation de la pert...
Définition et évaluation de modèles d'agrégation pour l'estimation de la pert...Bilel Moulahi
 
Aussenac confinvitéeic2014 histoire ic25ans
Aussenac confinvitéeic2014 histoire ic25ansAussenac confinvitéeic2014 histoire ic25ans
Aussenac confinvitéeic2014 histoire ic25ansNathalie Aussenac-Gilles
 

Viewers also liked (20)

121129 umls yes
121129 umls yes121129 umls yes
121129 umls yes
 
VIVO Mini-Grant: Integrating the UMLS Ontology into VIVO for Linking Biomedic...
VIVO Mini-Grant: Integrating the UMLS Ontology into VIVO for Linking Biomedic...VIVO Mini-Grant: Integrating the UMLS Ontology into VIVO for Linking Biomedic...
VIVO Mini-Grant: Integrating the UMLS Ontology into VIVO for Linking Biomedic...
 
UMLS as a Resource for Virtual Data Warehouse Infrastructure KEY
UMLS as a Resource for Virtual Data Warehouse Infrastructure KEYUMLS as a Resource for Virtual Data Warehouse Infrastructure KEY
UMLS as a Resource for Virtual Data Warehouse Infrastructure KEY
 
IRIT at clef 2015: A product search model for head queries
IRIT at clef 2015: A product search model for head queriesIRIT at clef 2015: A product search model for head queries
IRIT at clef 2015: A product search model for head queries
 
Thesis slides - Definition and evluation of collaborative information retriev...
Thesis slides - Definition and evluation of collaborative information retriev...Thesis slides - Definition and evluation of collaborative information retriev...
Thesis slides - Definition and evluation of collaborative information retriev...
 
Information Retrieval UMLS
Information Retrieval UMLSInformation Retrieval UMLS
Information Retrieval UMLS
 
Information Retrieval
Information RetrievalInformation Retrieval
Information Retrieval
 
Description and retrieval of medical visual information based on language mod...
Description and retrieval of medical visual information based on language mod...Description and retrieval of medical visual information based on language mod...
Description and retrieval of medical visual information based on language mod...
 
Health IT & Voice of Patient
Health IT & Voice of PatientHealth IT & Voice of Patient
Health IT & Voice of Patient
 
2. Triết học MBA (LS triết học Ấn Độ)
2. Triết học MBA (LS triết học Ấn Độ)2. Triết học MBA (LS triết học Ấn Độ)
2. Triết học MBA (LS triết học Ấn Độ)
 
Medical Information Retrieval and its Evaluation: an Overview of CLEF eHealth...
Medical Information Retrieval and its Evaluation: an Overview of CLEF eHealth...Medical Information Retrieval and its Evaluation: an Overview of CLEF eHealth...
Medical Information Retrieval and its Evaluation: an Overview of CLEF eHealth...
 
Challenges of managing Data Science Project
Challenges of managing Data Science ProjectChallenges of managing Data Science Project
Challenges of managing Data Science Project
 
Quels facteurs de pertinence pour la recherche de produits e-commerce ?
Quels facteurs de pertinence pour la recherche de produits e-commerce ?Quels facteurs de pertinence pour la recherche de produits e-commerce ?
Quels facteurs de pertinence pour la recherche de produits e-commerce ?
 
Ontology engineering
Ontology engineering Ontology engineering
Ontology engineering
 
Introduction au web sémantique : quand le lient fait sens
Introduction au web sémantique : quand le lient fait sensIntroduction au web sémantique : quand le lient fait sens
Introduction au web sémantique : quand le lient fait sens
 
Introduction to Information Retrieval & Models
Introduction to Information Retrieval & ModelsIntroduction to Information Retrieval & Models
Introduction to Information Retrieval & Models
 
Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...
Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...
Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...
 
Définition et évaluation de modèles d'agrégation pour l'estimation de la pert...
Définition et évaluation de modèles d'agrégation pour l'estimation de la pert...Définition et évaluation de modèles d'agrégation pour l'estimation de la pert...
Définition et évaluation de modèles d'agrégation pour l'estimation de la pert...
 
Aussenac confinvitéeic2014 histoire ic25ans
Aussenac confinvitéeic2014 histoire ic25ansAussenac confinvitéeic2014 histoire ic25ans
Aussenac confinvitéeic2014 histoire ic25ans
 
Aussenac ri ia-2015
Aussenac ri ia-2015Aussenac ri ia-2015
Aussenac ri ia-2015
 

Similar to Unified Medical Language System & MetaMap

The Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologyThe Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologySnow Owl
 
[Review] High-performance medicine: the convergence of human and artificial i...
[Review] High-performance medicine: the convergence of human and artificial i...[Review] High-performance medicine: the convergence of human and artificial i...
[Review] High-performance medicine: the convergence of human and artificial i...Dongmin Choi
 
Clinical data successes using machine learning (word2vec, RNN)
Clinical data successes using machine learning (word2vec, RNN)Clinical data successes using machine learning (word2vec, RNN)
Clinical data successes using machine learning (word2vec, RNN)Joseph Paul Cohen PhD
 
Natural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health RecordsNatural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health RecordsMMS Holdings
 
Fire and Ice - SNOMED for CORE strength - John Fountain
Fire and Ice - SNOMED for CORE strength - John FountainFire and Ice - SNOMED for CORE strength - John Fountain
Fire and Ice - SNOMED for CORE strength - John FountainHL7 New Zealand
 
Exhaustive Literature Searching (Systematic Reviews)
Exhaustive Literature Searching (Systematic Reviews)Exhaustive Literature Searching (Systematic Reviews)
Exhaustive Literature Searching (Systematic Reviews)markmac
 
A Quasi Relational Query Language for Persistent Standardized EHRs: Using NoS...
A Quasi Relational Query Language for Persistent Standardized EHRs: Using NoS...A Quasi Relational Query Language for Persistent Standardized EHRs: Using NoS...
A Quasi Relational Query Language for Persistent Standardized EHRs: Using NoS...Aastha Madaan
 
Deriving an ICU Subset of SNOMED CT from Clinical Notes
Deriving an ICU Subset of SNOMED CT from Clinical NotesDeriving an ICU Subset of SNOMED CT from Clinical Notes
Deriving an ICU Subset of SNOMED CT from Clinical NotesHealth Informatics New Zealand
 
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...Matthias Samwald
 
Iot analytics in wearables
Iot analytics in wearables Iot analytics in wearables
Iot analytics in wearables Jessica Willis
 
PhD dissertation Luis Marco Ruiz
PhD dissertation Luis Marco RuizPhD dissertation Luis Marco Ruiz
PhD dissertation Luis Marco RuizLuis Marco Ruiz
 
Using NLP and curation to make clinical data available for research
Using NLP and curation to make clinical data available for researchUsing NLP and curation to make clinical data available for research
Using NLP and curation to make clinical data available for researchWarren Kibbe
 
Prevalence Of Pressure Ulcer Name xxxUnited State Universit.docx
Prevalence Of  Pressure Ulcer Name xxxUnited State Universit.docxPrevalence Of  Pressure Ulcer Name xxxUnited State Universit.docx
Prevalence Of Pressure Ulcer Name xxxUnited State Universit.docxLacieKlineeb
 
Implementation and Use of ISO EN 13606 and openEHR
Implementation and Use of ISO EN 13606 and openEHRImplementation and Use of ISO EN 13606 and openEHR
Implementation and Use of ISO EN 13606 and openEHRKoray Atalag
 
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...Kees van Bochove
 
Exploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsExploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsNigel Collier
 
Elsevier Medical Graph – mit Machine Learning zu Precision Medicine
Elsevier Medical Graph – mit Machine Learning zu Precision MedicineElsevier Medical Graph – mit Machine Learning zu Precision Medicine
Elsevier Medical Graph – mit Machine Learning zu Precision MedicineRising Media Ltd.
 

Similar to Unified Medical Language System & MetaMap (20)

The Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologyThe Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to Terminology
 
svm_AD
svm_ADsvm_AD
svm_AD
 
[Review] High-performance medicine: the convergence of human and artificial i...
[Review] High-performance medicine: the convergence of human and artificial i...[Review] High-performance medicine: the convergence of human and artificial i...
[Review] High-performance medicine: the convergence of human and artificial i...
 
Clinical data successes using machine learning (word2vec, RNN)
Clinical data successes using machine learning (word2vec, RNN)Clinical data successes using machine learning (word2vec, RNN)
Clinical data successes using machine learning (word2vec, RNN)
 
Natural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health RecordsNatural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health Records
 
Fire and Ice - SNOMED for CORE strength - John Fountain
Fire and Ice - SNOMED for CORE strength - John FountainFire and Ice - SNOMED for CORE strength - John Fountain
Fire and Ice - SNOMED for CORE strength - John Fountain
 
Exhaustive Literature Searching (Systematic Reviews)
Exhaustive Literature Searching (Systematic Reviews)Exhaustive Literature Searching (Systematic Reviews)
Exhaustive Literature Searching (Systematic Reviews)
 
SNOMED CT
SNOMED CTSNOMED CT
SNOMED CT
 
A Quasi Relational Query Language for Persistent Standardized EHRs: Using NoS...
A Quasi Relational Query Language for Persistent Standardized EHRs: Using NoS...A Quasi Relational Query Language for Persistent Standardized EHRs: Using NoS...
A Quasi Relational Query Language for Persistent Standardized EHRs: Using NoS...
 
How to use terminology serves from Clinical Work Stations
How to use terminology serves from Clinical Work StationsHow to use terminology serves from Clinical Work Stations
How to use terminology serves from Clinical Work Stations
 
Deriving an ICU Subset of SNOMED CT from Clinical Notes
Deriving an ICU Subset of SNOMED CT from Clinical NotesDeriving an ICU Subset of SNOMED CT from Clinical Notes
Deriving an ICU Subset of SNOMED CT from Clinical Notes
 
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
 
Iot analytics in wearables
Iot analytics in wearables Iot analytics in wearables
Iot analytics in wearables
 
PhD dissertation Luis Marco Ruiz
PhD dissertation Luis Marco RuizPhD dissertation Luis Marco Ruiz
PhD dissertation Luis Marco Ruiz
 
Using NLP and curation to make clinical data available for research
Using NLP and curation to make clinical data available for researchUsing NLP and curation to make clinical data available for research
Using NLP and curation to make clinical data available for research
 
Prevalence Of Pressure Ulcer Name xxxUnited State Universit.docx
Prevalence Of  Pressure Ulcer Name xxxUnited State Universit.docxPrevalence Of  Pressure Ulcer Name xxxUnited State Universit.docx
Prevalence Of Pressure Ulcer Name xxxUnited State Universit.docx
 
Implementation and Use of ISO EN 13606 and openEHR
Implementation and Use of ISO EN 13606 and openEHRImplementation and Use of ISO EN 13606 and openEHR
Implementation and Use of ISO EN 13606 and openEHR
 
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
 
Exploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsExploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease Informatics
 
Elsevier Medical Graph – mit Machine Learning zu Precision Medicine
Elsevier Medical Graph – mit Machine Learning zu Precision MedicineElsevier Medical Graph – mit Machine Learning zu Precision Medicine
Elsevier Medical Graph – mit Machine Learning zu Precision Medicine
 

Recently uploaded

FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 

Recently uploaded (20)

FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 

Unified Medical Language System & MetaMap

  • 1. Mapping biomedical literature into UMLS concepts MetaMap Presented By: Osama Jomaa Miami University
  • 3. Motivation “... to facilitate the development of computer systems that behave as if they "understand" the meaning of the language of biomedicine and health.” National Library of Medicine
  • 4. UMLS Components 1.Metathesaurus +1 Million biomedical concepts from over 100 vocabularies 2..Semantic Network 133 categories & 54 relationships. 3..Specialist Lexicon & Lexical Tools Software programs to aid in NLP
  • 5. Meta thesaurus Patient Care Controlled Terms Biomedical Vocabs from Different Languages Clinical/Health Services Research Health Services Billing Biomedical Literature Catalogs Public Health Statistics . . . . . 5,000,000biomedicalterm 1,000,000Concepts + 100 Source Vocabs Relational DB Tables
  • 6. Metathesaurus ●Concepts are classified into categories: –Diagnosis –Procedures & Supplies –Diseases –…. ●Concepts have unique identifier. ●Concepts have preferred terms. ●Concepts can be grouped into subsets via applying filters.
  • 8. One Concept Many Terms One concept can have many terms in multiple vocabularies. Example: Atrial Fibrillation
  • 10. Unique Identifiers ● Concept Unique Identifier (CUI) Link all the names in all the source vocabs that mean the same to one concept and assign a unique identifier, CUI, to it. ● Lexical Unique Identifier (LUI) Are lexical variants for the concepts detected using Lexical Variant Generator (LVG) program. ● String Unique Identifier (SUI) Represents variations in the char set, upper-lower case, or permutation difference. ● Atom Unique Identifier (AUI) Every occurrence of a string in each source vocab is assigned a unique identifier, AUI.
  • 11.
  • 12. Semantic Network ● Semantic Types +133 types, each MT concept assigned one semantic type at least. ● Semantic Relationships 54 relationaship. Is-A is the most important.
  • 13. Semantic Network Semantic Types Examples: ✔ Organisms ✔ Anatomical structures ✔ Biologic function ✔ Chemicals ✔ Physical objects Entity Event Semantic Relationships Examples: ✔ Physically related to ✔ Spatially related to ✔ Temporally related to ✔ Functionally related to ✔ Conceptually related to
  • 14. Lexical Tools ●The Specialist Lexicon Is an English lexicon (dictionary) that includes over 200,000 biomedical terms from a variety of source to aid in NLP. ●Lexical Variant Generator (LVG) ●Norm Normalizer ●Wordind Tokenizer
  • 16. Why Concept Identification? ● Information extraction/Data mining ● Classification/Categorization ● Text summarization ● Question answering ● Literature-based Knowledge Discovery
  • 17. Example Phrase: “lung cancer.” Meta Candidates (8): 1000 Lung Cancer {MDR,DXP} (Malignant neoplasm of lung) [Neoplastic Process] 1000 Lung Cancer (Carcinoma of lung) [Neoplastic Process] 861 Cancer (Malignant Neoplasms) [Neoplastic Process] 861 Lung [Body Part, Organ, or Organ Component] 861 Cancer (Cancer Genus) [Invertebrate] 861 Lung (Entire lung) [Body Part, Organ, or Organ Component] 861 Cancer (Specialty Type - cancer) [Biomedical Occupation or Discipline] 768 Pneumonia [Disease or Syndrome] Meta Mapping (1000): 1000 Lung Cancer (Carcinoma of lung) [Neoplastic Process] Meta Mapping (1000): 1000 Lung Cancer (Malignant neoplasm of lung) [Neoplastic Process]
  • 19. MetaMap Options ● Word Sense Disambiguation (-y) Determines which concept is the best choice using surrounding context. ● Negation (--negx) Identifies negated entities.
  • 20. Examples ● WSD Examples –“Fifteen (6.4%) of 234 colds treated with placebo ..” ● Cold (cold temperature) [npop] ● Cold (Common cold) [dsyn] ● Cold (Cold Sensation) [phsf] –“.. the drugs were compared in two four-point, double-blind bioassays.” ● Double (Diplopia) [dsyn] vs. Double (Duplicate) [ftcn] ● Blind (Blind Vision) [dsyn] vs. BLIND (Blinded) [reasa] vs. Blind (Visually impaired persons) [podg] ● Bioassays (Biological Assay) [lbpr]
  • 21. Examples ● Negation Example – “There is no focal infiltrate or pleural effusion.” – --negex output(in addition to normal output): NEGATIONS: Negation Type:nega Negation Trigger: no Negation PosInfo: 9/2 Negated Concept: C0332448:Infiltrate Concept PosInfo: 18/10 Negation Type:nega Negation Trigger: no Negation PosInfo: 9/2 Negated Concept: C2073625:pleural effusion, C0032227:Pleural Effusion Concept PosInfo: 32/16
  • 22. Other Options ● -@ --WSD <hostname> : Which WSD server to use. ● -8 --dynamic_variant_generation : dynamic variant generation ● -D --all_derivational_variants : all derivational variants ● -J --restrict_to_sts <semtypelist> : restrict to semantic types ● -K --ignore_stop_phrases : ignore stop phrases. ● -R --restrict_to_sources <sourcelist> : restrict to sources ● -V --mm_data_version <name> : version of MetaMap data to use. ● -X --truncate_candidates_mappings : truncate candidates mapping ● -Y --prefer_multiple_concepts : prefer multiple concepts ● -Z --mm_data_year <name> : year of MetaMap data to use. ● -a --all_acros_abbrs : allow Acronym/Abbreviation variants ● -b --compute_all_mappings : compute/display all mappings ● -d --no_derivational_variants : no derivational variants ● -e --exclude_sources <sourcelist> : exclude semantic types ● -g --allow_concept_gaps : allow concept gaps ● -i --ignore_word_order : ignore word order ● -k --exclude_sts <semtypelist> : exclude semantic types ● -o --allow_overmatches : allow overmatches ● -r --threshold <integer> : Threshold for displaying candidates. ●
  • 23. MetaMap Output Formats ● Human-readable outputp ● MetaMap Machine Output (MMO) ● XML output ● Colorized MetaMap output (MetaMap 3D) ● Fielded (MMI) Outputs
  • 24. Human Readable Phrase: "heart attack" Meta Candidates (8): 1000 Heart attack (Myocardial Infarction) [Disease or Syndrome] 861 Heart [Body Part, Organ, or Organ Component] 861 Attack, NOS (Onset of illness) [Finding] 861 Attack (Attack device) [Medical Device] 861 attack (Attack behavior) [Social Behavior] 861 Heart (Entire heart) [Body Part, Organ, or Organ Component] 861 Attack (Observation of attack) [Finding] 827 Attacked (Assault) [Injury or Poisoning] Meta Mapping (1000): 1000 Heart attack (Myocardial Infarction) [Disease or Syndrome]
  • 25. Machine Output candidates([ ev(-1000, 'C0027051', 'Heart attack', 'Myocardial Infarction', [heart,attack], [dsyn], [[[1,2],[1,2],0]], yes, no, ['MEDLINEPLUS], [0/12]), ev(-861, 'C0018787', 'Heart', 'Heart', [heart], [bpoc], [[[1,1],[1,1],0]], yes, no, ['AIR'],[0/5]), ev(-861, 'C0277793', 'Attack, NOS', 'Onset of illness', [attack], [fndg], [[[2,2],[1,1],0]], yes, no, ['MTH'], [6/6]), ev(-861, 'C0699795', 'Attack', 'Attack device', [attack], [medd] [[[2[medd],[[[2,2], [1,1],0]],2] [1 1] 0]] yesyes, nono, ['MTH'[ MTH ,'MMSL']MMSL ], [6/6])[6/6]), ev(-861, 'C1261512', attack, 'Attack behavior', [attack],[socb], [[[2,2],[1,1],0]], yes, no, ['MTH','PSY','AOD'], [6/6]), ev(-861, 'C1281570', 'Heart', 'Entire heart', [heart], [bpoc], [[[1,1],[1,1],0]], yes, no, ['MTH','SNOMEDCT'], [0/5]), Ev(-861, , 'C1304680',, 'Attack',, 'Observation of attack',, [attack],,[fndg], [[[2,2], [1,1],0]],yes, no, ['MTH','SNOMEDCT'], [6/6]), ev(-827, 'C0004063', 'Attacked', 'Assault', [attacked], [inpo], [[[2,2],[1,1],1]], yes, no, ['ICD10AM'], [6/6])]).
  • 26. Unformatted XML <Candidate><CandidateScore>- 1000</CandidateScore><CandidateCUI>C0027051</CandidateCUI><CandidateM atched>Heart attack</CandidateMatched><CandidatePreferred>Myocardial Infarction</CandidatePreferr ed><MatchedWords Count=2><MatchedWord>heart</MatchedWord><MatchedWord>attack</MatchedWord></Match edWords><SemTypes Count=1><SemType>dsyn</SemType></SemTypes><MatchMaps Count=1><MatchMap><TextMat chStart>1</TextMatchStart><TextMatchEnd>2</TextMatchEnd><ConcMatchStart>1</ConcMatchStart><C oncMa tchEnd>2</ConcMatchEnd><LexVariation>0</LexVariation></MatchMap></MatchMaps><IsHead>yes</Is Head>< IsOverMatch>no</IsOverMatch><Sources Count=24><Source>MEDLINEPLUS</Source></Sources><ConceptPIs C ount=1><ConceptPI><StartPos>0</StartPos><Length>12</Length></ConceptPI></ConceptPIs></Candid ate>
  • 27. Formatted XML <Candidate> <CandidateScore>-1000</CandidateScore> <CandidateCUI>C0027051</CandidateCUI> <CandidateMatched>Heart attack</CandidateMatched> <CandidatePreferred>Myocardial Infarction</CandidatePreferred> <MatchedWords Count=2><MatchedWord>heart</MatchedWord><MatchedWord>attack</MatchedWord></MatchedWords> <SemTypes> <Count=1><SemType>dsyn</SemType></SemTypes> <MatchMaps Count=1> <MatchMap> <TextMatchStart>1</TextMatchStart> <ConcMatchEnd>2</ConcMatchEnd> <LexVariation>0</LexVariation> </MatchMap> </MatchMaps> <IsHead>yes</IsHead> <IsOverMatch>no</IsOverMatch> <Sources Count=24><Source>MEDLINEPLUS</Source></Sources> <ConceptPIs Count=1><ConceptPI><StartPos>0</StartPos><Length>12</Length></ConceptPI></ConceptPIs> </Candidate>
  • 29. MetaMap: Technical Aspect ● Download –MetaMap API Underlying Architecture. –MetaMap Java API. ●Extract and Install –$ bzip2 -dc public_mm_linux_javaapi_{four-digit-year}.tar.bz2 | tar xvf - –$ ./bin/install.sh ● Starting MetaMap Server $ ./bin/skrmedpostctl start #Start SKR Server $ ./bin/wsdserverctl start #Start WSD Server (Optional) $ ./bin/mmserver{two-digit-year} #Start MetaMap Server
  • 30. MetaMap Java API Two jar files contain the API: ✔ /src/javaapi/dist/MetaMapApi.jar ✔ /src/javaapi/dist/prologbeans.jar
  • 31. Code Time :) MetaMapApi api = new MetaMapApiImpl("localhost"); List<Result> resultList = api.processCitationsFromFile("Abstract.txt"); Result result = resultList.get(0);
  • 32. Code Time :) for (Utterance utterance: result.getUtteranceList()) { System.out.println("Utterance:"); System.out.println(" Id: " + utterance.getId()); System.out.println(" Utterance text: " + utterance.getString()); System.out.println(" Position: " + utterance.getPosition());
  • 33. Code Time :) for (PCM pcm: utterance.getPCMList()) { System.out.println("Phrase:"); System.out.println(" text: " + pcm.getPhrase().getPhraseText()); System.out.println("Candidates:"); for (Ev ev: pcm.getCandidateList()) { System.out.println(" Candidate:"); System.out.println(" Score: " + ev.getScore()); System.out.println(" Concept Id: " + ev.getConceptId()); System.out.println(" Concept Name: " + ev.getConceptName()); System.out.println(" Preferred Name: " + ev.getPreferredName()); System.out.println(" Matched Words: " + ev.getMatchedWords()); System.out.println(" Semantic Types: " + ev.getSemanticTypes()); System.out.println(" MatchMap: " + ev.getMatchMap()); System.out.println(" MatchMap alt. repr.: " + ev.getMatchMapList()); System.out.println(" is Head?: " + ev.isHead()); System.out.println(" is Overmatch?: " + ev.isOvermatch()); System.out.println(" Sources: " + ev.getSources()); System.out.println(" Positional Info: " + ev.getPositionalInfo()); }
  • 34. Code Time :) System.out.println("Mappings:"); for (Mapping map: pcm.getMappingList()) { System.out.println(" Map Score: " + map.getScore()); for (Ev mapEv: map.getEvList()) { System.out.println(" Score: " + mapEv.getScore()); System.out.println(" Concept Id: " + mapEv.getConceptId()); System.out.println(" Concept Name: " + mapEv.getConceptName()); System.out.println(" Preferred Name: " + mapEv.getPreferredName()); System.out.println(" Matched Words: " + mapEv.getMatchedWords()); System.out.println(" Semantic Types: " + mapEv.getSemanticTypes()); System.out.println(" MatchMap: " + mapEv.getMatchMap()); System.out.println(" MatchMap alt. repr.: " + mapEv.getMatchMapList()); System.out.println(" is Head?: " + mapEv.isHead()); System.out.println(" is Overmatch?: " + mapEv.isOvermatch()); System.out.println(" Sources: " + mapEv.getSources()); System.out.println(" Positional Info: " + mapEv.getPositionalInfo()); } } } }