API |
Application Programming Interface: a facility offered by a web resource which allows search queries independent of a GUI, often performed using scripts |
bash |
default program that runs in the command line |
bias |
systematic error that results from an unbalanced sample |
big data |
huge amount of data, identifiable through repeated freezing of your standard program when opening a file |
born digital data |
data which originated in a digital form |
CLI |
Command Line Interface, text interface that allows interaction with the computer; see also bash |
close reading |
careful and attentive interpretation of a text |
CMS |
Content Management System |
Console |
See CLI |
Crowdsourcing |
projects that include the active participation of the public to generate content, transcribe sources etc. |
csv |
comma separated values, a structured text format, using commas as separators between columns |
distant reading |
quantitative approach to huge amounts of texts, using computational methods to search for interpretable patterns |
GUI |
Graphical User Interface |
HTML |
Hypertext Markup Language, a structured text format, like the format this guide is written in, to render documents in a browser |
Jupyter notebook |
web application/interactive coding environment that runs in a browser; let’s you create and share code (https://jupyter.org) |
machine learning |
umbrella term for different methods that use data to do a task in a specific way, using data to learn and to improve the results |
machine readable |
transformation of, for example, text into a data format that is processable by a computer |
OCR |
Optical Character Recognition, process of transforming text on an image into a data format |
OS |
Operating System |
open source |
freely available source code that can be used, modified and redistributed without limitations |
OSS |
Open Source Software |
Regular Expression |
syntax for search and replace text using patterns (instead of exact matches) |
terminal |
See CLI |
web scraping |
extracting data from websites |