Anhang A — Glossar

Glossary

  • API: Application Programming Interface: a facility offered by a web resource which allows search queries independent of a GUI, often performed using scripts
  • bash: default program that runs in the command line
  • big data: huge amount of data, identifiable through repeated freezing of your standard program when opening a file
  • born digital data: data which originated in a digital form
  • CLI: Command Line Interface, text interface that allows interaction with the computer; see also bash
  • CMS: Content Management System
  • Console: See CLI
  • Crowdsourcing: projects that include the active participation of the public to generate content, transcribe sources etc.
  • csv: comma separated values, a structured text format, using commas as separators between columns
  • distant reading: quantitative approach to huge amounts of texts, using computational methods to search for interpretable patterns
  • GUI: Graphical User Interface
  • HTML: Hypertext Markup Language, a structured text format, like the format this guide is written in, to render documents in a browser
  • Jupyter notebook: web application/interactive coding environment that runs in a browser; let’s you create and share code (https://jupyter.org)
  • machine readable: transformation of, for example, text into a data format that is processable by a computer
  • OCR: Optical Character Recognition, process of transforming text on an image into a data format
  • OS: Operating System
  • OSS: Open Source Software
  • Regular Expression: syntax for search and replace text using patterns (instead of exact matches)
  • terminal: See CLI