Synonym extraction and abbreviation expansion with ensembles of semantic spaces, Journal of Biomedical Semantics
Por um escritor misterioso
Last updated 25 maio 2024
Background Terminologies that account for variation in language use by linking synonyms and abbreviations to their corresponding concept are important enablers of high-quality information extraction from medical texts. Due to the use of specialized sub-languages in the medical domain, manual construction of semantic resources that accurately reflect language use is both costly and challenging, often resulting in low coverage. Although models of distributional semantics applied to large corpora provide a potential means of supporting development of such resources, their ability to isolate synonymy from other semantic relations is limited. Their application in the clinical domain has also only recently begun to be explored. Combining distributional models and applying them to different types of corpora may lead to enhanced performance on the tasks of automatically extracting synonyms and abbreviation-expansion pairs. Results A combination of two distributional models – Random Indexing and Random Permutation – employed in conjunction with a single corpus outperforms using either of the models in isolation. Furthermore, combining semantic spaces induced from different types of corpora – a corpus of clinical text and a corpus of medical journal articles – further improves results, outperforming a combination of semantic spaces induced from a single source, as well as a single semantic space induced from the conjoint corpus. A combination strategy that simply sums the cosine similarity scores of candidate terms is generally the most profitable out of the ones explored. Finally, applying simple post-processing filtering rules yields substantial performance gains on the tasks of extracting abbreviation-expansion pairs, but not synonyms. The best results, measured as recall in a list of ten candidate terms, for the three tasks are: 0.39 for abbreviations to long forms, 0.33 for long forms to abbreviations, and 0.47 for synonyms. Conclusions This study demonstrates that ensembles of semantic spaces can yield improved performance on the tasks of automatically extracting synonyms and abbreviation-expansion pairs. This notion, which merits further exploration, allows different distributional models – with different model parameters – and different types of corpora to be combined, potentially allowing enhanced performance to be obtained on a wide range of natural language processing tasks.
Synonym extraction and abbreviati preview & related info
Semi-supervised medical entity recognition: A study on Spanish and
Journal of Biomedical Semantics
Automatically refining synonym extraction results: Cleaning and
IJERPH, Free Full-Text
Literature mining for the biologist: from information retrieval to
Information Retrieval and Text Mining Technologies for Chemistry
PDF) Synonym extraction and abbreviation expansion with ensembles
A comparison of word embeddings for the biomedical natural
ACL Search Tool
Recomendado para você
-
PDF] Mining and Ranking Biomedical Synonym Candidates from25 maio 2024
-
Synonym & Antonym Dictionary – Allganize25 maio 2024
-
Output data from step 1. Three synonym clusters with term25 maio 2024
-
Synonyms for doctoral candidate doctoral candidate synonyms25 maio 2024
-
Figure 3.1 from Extraction of synonyms and semantically related25 maio 2024
-
11 Synonyms for “Experience” on Your Resume - WordSelector25 maio 2024
-
Table 7 from Automatic Extraction of Synonyms for German Particle25 maio 2024
-
941% Traffic Increase Exploiting the Synonyms SEO Ranking Technique25 maio 2024
-
Synonyms Abettor Advocate Abide Tolerate Ability Talent Barren Unfertile Pappu yar what is the25 maio 2024
-
CANDIDATE definition in American English25 maio 2024
você pode gostar
-
Credo Di No Penso Di No None Ma Anche No GIF - I Guess Not Dont Think So Nope - Discover & Share GIFs25 maio 2024
-
Pokemon Fire Red Normal by 8Angel8 - Game Jolt25 maio 2024
-
Genshin Impact's developer is making too many new gacha games, but25 maio 2024
-
Da Benzema a Brozovic, passando per Kanté: tutti i calciatori25 maio 2024
-
Xbox One, Sunset Overdrive Wiki25 maio 2024
-
THE ENDGAME (2022) Morena Baccarin, Thriller25 maio 2024
-
BFBGM's Originals Drawn In Kleki by GalaxyOfVal on Newgrounds25 maio 2024
-
Ravanta Looker ekbrilas Big Bum kaj Gets Anus Rode25 maio 2024
-
Carta Pokémon Mewtwo, Promoçoes e Ofertas25 maio 2024
-
My dog stepped on a bee : r/memes25 maio 2024