Subsetting tool to evaluate biases and errors

This resource provides tools for subsetting and evaluating datasets that have not originally been created for research. Thanks to this resource, researchers will be able to robustly explore large datasets, examine their representativeness, and extract the subset they are interested in.

End-user interface links and usage instructions for centrally indexed datasets: https://github.com/hsci-r/elasticsearch-openshift/blob/main/documentation/exported_query.md
Technical documentation enabling people to set up their own instances for their own datasets: https://github.com/hsci-r/elasticsearch-openshift

Resource developed by the University of Helsinki in partnership with CSC – the IT Center for Science.

Contact about this resource: Eetu Mäkelä

DARIAH-FI OFFICE:

The National Archives of Finland

Tanja Välisalo

DARIAH-FI: YLEISET KYSYMYKSET

DARIAH-FI: GENERAL

DARIAH-KONTTORI:

Turun yliopisto

Veronika Laippala

DARIAH-KONTTORI:

Jyväskylän yliopisto

Tanja Välisalo

DARIAH-KONTTORI:

Itä-Suomen yliopisto

Paula Rautionaho

DARIAH-KONTTORI:

Oulun yliopisto

Marika Rauhala

DARIAH-KONTTORI:

Aalto-yliopisto

Eero Hyvönen

DARIAH-KONTTORI:

Helsingin yliopisto

Risto Turunen

DARIAH-KONTTORI:

TampereEN YLIOPISTO

Sanna Kumpulainen

DARIAH-KONTTORI:

Suomen Kansalliskirjasto

Johanna Lilja

DARIAH-KONTTORI:

CSC – Tieteen tietotekniikan keskus

Katri Tegel

DARIAH-FI OFFICE:

CSC – IT Centre for Science

Katri Tegel

DARIAH-FI OFFICE:

National Library of Finland

Johanna Lilja

DARIAH-FI OFFICE:

Tampere University

Sanna Kumpulainen

DARIAH-FI OFFICE:

Aalto University

Eero Hyvönen

DARIAH-FI OFFICE:

University of Oulu

Marika Rauhala

DARIAH-FI OFFICE:

University of Eastern Finland

Paula Rautionaho

DARIAH-FI OFFICE:

Jyväskylä University

Venla Poso

DARIAH-FI OFFICE:

University of Turku

Veronika Laippala