The glossary has been updated with the following terms: Big Data • Compliance • Concordance • Mash-Up • Natural Language Processing • Point-of-Need.
The glossary has been updated with the following terms: Disposition • Caption • Case-Folding • Truecasing • Normalization • Stemming • Lemmatization • Heuristic • Porter’s Algorithm • Phrase Queries • Phrase Index • Biword Index • Positional Index.
The glossary has been updated with the following terms: Precision • Recall • Inverted Index • Inverted File • Dictionary • Vocabulary • Lexicon • Syllabus • Headnote • Decision • Opinion • Dissenting Opinion • Concurring Opinion.
As part of my own self-study to expand my knowledge of issues related to the use of legal data, I started to create a glossary of common terminology to help me absorb concepts and organize my thoughts. Since the glossary would likely include terminology from various related fields (computer science, web development, law, legal publishing, statistics, data visualization, etc.). I thought I would post it here in the hope that someone else may find it useful (or even better offer suggestions and feedback, hint, hint). My intent is to build the glossary incrementally as I encounter terms during my job, while blogging, or while reading outside materials. The first terms added will likely be basic IR terms because I am currently reading Introduction to Information Retrieval by Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze (Cambridge University Press, 2009).
I will post here as new terms are added.
UPDATE: The glossary has been posted and can be accessed through the header navigation bar. The glossary includes the following terms: Boolean Logic • Corpus • Document Unit • Grepping • Indexing • Information Need • Information Retrieval • Relevance • Stop List • Stop Word • Tokenization