|
|
|
|
LEADER |
05172nam a22004093i 4500 |
001 |
EBC5788423 |
003 |
MiAaPQ |
005 |
20231204023214.0 |
006 |
m o d | |
007 |
cr cnu|||||||| |
008 |
231204s2019 xx o ||||0 eng d |
020 |
|
|
|a 9783030138455
|q (electronic bk.)
|
020 |
|
|
|z 9783030138448
|
035 |
|
|
|a (MiAaPQ)EBC5788423
|
035 |
|
|
|a (Au-PeEL)EBL5788423
|
035 |
|
|
|a (OCoLC)1106161463
|
040 |
|
|
|a MiAaPQ
|b eng
|e rda
|e pn
|c MiAaPQ
|d MiAaPQ
|
050 |
|
4 |
|a RC261-271
|
100 |
1 |
|
|a Couto, Francisco M.
|
245 |
1 |
0 |
|a Data and Text Processing for Health and Life Sciences.
|
250 |
|
|
|a 1st ed.
|
264 |
|
1 |
|a Cham :
|b Springer International Publishing AG,
|c 2019.
|
264 |
|
4 |
|c Ã2019.
|
300 |
|
|
|a 1 online resource (107 pages)
|
336 |
|
|
|a text
|b txt
|2 rdacontent
|
337 |
|
|
|a computer
|b c
|2 rdamedia
|
338 |
|
|
|a online resource
|b cr
|2 rdacarrier
|
490 |
1 |
|
|a Advances in Experimental Medicine and Biology Series ;
|v v.1137
|
505 |
0 |
|
|a Intro -- Preface -- Acknowledgments -- Contents -- Acronyms -- 1 Introduction -- Biomedical Data Repositories -- Scientific Text -- Amount of Text -- Ambiguity and Contextualization -- Biomedical Ontologies -- Programming Skills -- Why This Book? -- Third-Party Solutions -- Simple Pipelines -- How This Book Helps Health and Life Specialists? -- Shell Scripting -- Text Files -- Relational Databases -- What Is in the Book? -- Command Line Tools -- Pipelines -- Regular Expressions -- Semantics -- 2 Resources -- Biomedical Text -- What? -- Where? -- How? -- Semantics -- What? -- Languages -- Formality -- Gold Related Documents -- Where? -- OBO Ontologies -- Popular Controlled Vocabularies -- How? -- OWL -- URI -- Further Reading -- 3 Data Retrieval -- Caffeine Example -- Unix Shell -- Current Directory -- Windows Directories -- Change Directory -- Useful Key Combinations -- Shell Version -- Data File -- File Contents -- Reverse File Contents -- My First Script -- Line Breaks -- Redirection Operator -- Installing Tools -- Permissions -- Debug -- Save Output -- Web Identifiers -- Single and Double Quotes -- Comments -- Data Retrieval -- Standard Error Output -- Data Extraction -- Single and Multiple Patterns -- Data Elements Selection -- Task Repetition -- Assembly Line -- File Header -- Variable -- XML Processing -- Human Proteins -- PubMed Identifiers -- PubMed Identifiers Extraction -- Duplicate Removal -- Complex Elements -- XPath -- Namespace Problems -- Only Local Names -- Queries -- Extracting XPath Results -- Text Retrieval -- Publication URL -- Title and Abstract -- Disease Recognition -- Further Reading -- 4 Text Processing -- Pattern Matching -- Case Insensitive Matching -- Number of Matches -- Invert Match -- File Differences -- Evaluation Metrics -- Word Matching -- Regular Expressions -- Extended Syntax -- Alternation -- Basic Syntax.
|
505 |
8 |
|
|a Scope -- Multiple Alternatives -- Multiple Characters -- Spaces -- Groups -- Ranges -- Negation -- Quantifiers -- Optional -- Multiple and Optional -- Multiple and Compulsory -- All Options -- Position -- Beginning -- Ending -- Near the End -- Word in Between -- Full Line -- Match Position -- Tokenization -- Character Delimiters -- Wrong Tokens -- String Replacement -- Multi-character Delimiters -- Keep Delimiters -- Sentences File -- Entity Recognition -- Select the Sentence -- Pattern File -- Relation Extraction -- Multiple Filters -- Relation Type -- Remove Relation Types -- Further Reading -- 5 Semantic Processing -- Classes -- OWL Files -- Class Label -- Class Definition -- Related Classes -- URIs and Labels -- URI of a Label -- Label of a URI -- Synonyms -- URI of Synonyms -- Parent Classes -- Labels of Parents -- Related Classes -- Labels of Related Classes -- Ancestors -- Grandparents -- Root Class -- Recursion -- Iteration -- My Lexicon -- Ancestors Labels -- Merging Labels -- Ancestors Matched -- Generic Lexicon -- All Labels -- Problematic Entries -- Special Characters Frequency -- Completeness -- Removing Special Characters -- Removing Extra Terms -- Removing Extra Spaces -- Disease Recognition -- Performance -- Inverted Recognition -- Case Insensitive -- ASCII Encoding -- Correct Matches -- Incorrect Matches -- Entity Linking -- Modified Labels -- Ambiguity -- Surrounding Entities -- Semantic Similarity -- Measures -- DiShIn Installation -- Database File -- DiShIn Execution -- Large Lexicons -- MER Installation -- Lexicon Files -- MER Execution -- Further Reading -- Bibliography -- Index.
|
588 |
|
|
|a Description based on publisher supplied metadata and other sources.
|
590 |
|
|
|a Electronic reproduction. Ann Arbor, Michigan : ProQuest Ebook Central, 2023. Available via World Wide Web. Access may be limited to ProQuest Ebook Central affiliated libraries.
|
655 |
|
4 |
|a Electronic books.
|
776 |
0 |
8 |
|i Print version:
|a Couto, Francisco M.
|t Data and Text Processing for Health and Life Sciences
|d Cham : Springer International Publishing AG,c2019
|z 9783030138448
|
797 |
2 |
|
|a ProQuest (Firm)
|
830 |
|
0 |
|a Advances in Experimental Medicine and Biology Series
|
856 |
4 |
0 |
|u https://ebookcentral.proquest.com/lib/matrademy/detail.action?docID=5788423
|z Click to View
|