This resource provides tools for subsetting and evaluating datasets that have not originally been created for research. Thanks to this resource, researchers will be able to robustly explore large datasets, examine their representativeness, and extract the subset they are interested in.