Subsetting tool to evaluate biases and errors
This resource provides tools for subsetting and evaluating datasets that have not originally been created for research. Thanks to this resource, researchers will be able to robustly explore large datasets, examine their representativeness, and extract the subset they are interested in.
End-user interface links and usage instructions for centrally indexed datasets: https://github.com/hsci-r/elasticsearch-openshift/blob/main/documentation/exported_query.md
Technical documentation enabling people to set up their own instances for their own datasets: https://github.com/hsci-r/elasticsearch-openshift
Resource developed by the University of Helsinki in partnership with CSC – the IT Center for Science.
Contact about this resource: Eetu Mäkelä