Interactive data visualizations from the November 2025 House Oversight Committee document release
UMAP dimensionality reduction of all 69,290 document embeddings. Shows semantic similarity between documents - similar topics cluster together.
Colors: Red = IMAGES, Teal = TEXT
View VisualizationCo-occurrence network of names in documents. Node size indicates mention frequency. Edges show which names appear together in the same documents.
Hover over nodes to see mention counts.
View VisualizationBreakdown of documents by type (IMAGES vs TEXT) and volume. Shows the composition of the 69,290 document chunks.
TEXT-001 = 44.7% of all chunks (legal documents)
View Visualization