Set Size Supplement

Estimating the Size of a Set Using Cascading Exclusion - Supplementary Materials

Author

Susan Holmes

Overview

This repository contains all supplementary materials, code, and analyses for the paper “Estimating the Size of a Set Using Cascading Exclusion.” by Sourav Chatterjee, Persi Diaconis and Susan Holmes, August 2025.

Data

  • silva_nr99_v138.2_toGenus_trainset.fa: Original SILVA database version
  • silva_aligned_bacteria_sequences.fasta: Aligned bacterial DNA sequences
  • silva_sequence_info.csv: Sequence metadata and taxonomic information

Reproducibility

Clone this repository and run render or build.