INCA: Infrastructure for content analysis

Open Access
Authors
Publication date 2018
Book title IEEE 14th International Conference on eScience
Book subtitle proceedings : 29 October-1 November 2018, Amsterdam, the Netherlands
ISBN
  • 9781538691571
ISBN (electronic)
  • 9781538691564
Event 14th eScience international IEEE conference
Pages (from-to) 329-330
Publisher Los Alamitos, California: IEEE Computer Society
Organisations
  • Faculty of Social and Behavioural Sciences (FMG) - Amsterdam School of Communication Research (ASCoR)
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
We present INCA (short for INfrastructure for Content Analysis), a Python module for collecting, storing, processing, and analyzing a wide variety of media content, including but not limited to news, political debates, social media, forums, and customer reviews. Using Elasticsearch as a database backend and Celery for task management, it makes automated content analysis scalable. INCA's main objective is to enable and promote an integrated workflow. INCA focuses on re-usability of data, processors, and analyses; making all steps of automated content analysis (ACA) accessible to social scientists, without requiring advanced programming skills. Here, we present the aim, implementation and recommended workflow for INCA.
Document type Conference contribution
Language English
Published at https://doi.org/10.1109/eScience.2018.00078
Downloads
08588697 (Final published version)
Permalink to this page
Back