Anyone interested in govt transparency and public access should check out GovScape from @bcgl.bsky.social and his teamπ
It's an incredibly powerful tool that allows visual, semantic text, and keywords search of 10 million U.S. government PDFs (70 million pages!) and counting: www.govscape.net
19.11.2025 18:24 β π 51 π 26 π¬ 1 π 1
New Research Tool: GovScape (US Gov PDFs)
govscape.net ||| Research Paper (preprint) About #GovScape arxiv.org/abs/2511.11010 #govdocs @eotarchive.org
19.11.2025 14:20 β π 7 π 6 π¬ 0 π 0
Weβre live! Search 10 million+ U.S. government PDFs (70 million pages)! GovScape offers visual search, semantic text search, and keyword search. Explore below:
Website: govscape.net
ArXiv link: arxiv.org/abs/2511.11010
18.11.2025 21:16 β π 17 π 5 π¬ 0 π 1
Huge step forward in enabling access and use of content archived from government websites!
18.11.2025 20:27 β π 11 π 3 π¬ 0 π 0
GovScape - Search 10+ Million Government PDFs
My favorite use of govscape.net is to search "[INSERT COMPANY] lawsuit". See: the time that Skippy sued customs to demand that their fake peanut butter be recognized as "peanut butter or peanut slurry".
govscape.net/preview/KBVJ...
@bcgl.bsky.social @govscape.bsky.social
19.11.2025 16:27 β π 3 π 0 π¬ 0 π 0
Link rot on the government web is actually one of the things weβre hoping to look into with this dataset! But, if you just want the PDF, you can use the βdownload PDFβ button to grab it from the archive instead
19.11.2025 12:42 β π 5 π 0 π¬ 1 π 0
1/ Announcing GovScape β a public search system for 10 million U.S. government PDFs (70 million pages)! GovScape offers visual search, semantic text search, and keyword search. Explore below:
Website: www.govscape.net
ArXiv link: arxiv.org/abs/2511.11010
18.11.2025 20:19 β π 79 π 35 π¬ 3 π 4
Washed-up D2 runner, information scientist, U-M research faculty, criminology data archive director, and girl (2x) dad. Thoughts are my own.
Music librarian, historian, digital humanist. SUCHO co-founder. First-gen Polish-American immigrant. Pronouns: she series
Online: annakijas.com / sucho.org
π Internet Archive.org & @EOTArchive.org & FDLP ποΈ We build tools so the past and future can meet.
Public Access to Public Data is a Public Good. We want to ensure our data are not gone forever. Read more about our efforts: https://www.datarescueproject.org/press/
Playing with data, visualizing it for humans at flowingdata.com
Assistant Professor @ University of Washington, Information School. PhD in English from WashU in St. Louis.
Iβm interested in books, data, social media, & digital humanities.
They call me "Eyre Jordan" on the bball court. π
https://melaniewalsh.org/
Associate dean for academic affairs and Associate professor at Chicago-Kent College of Law (Illinois Tech). I study financial markets regulation & law of capitalism. But this β IIT. DSA Fund board πΉπ€, Rstats, NLP, Phish dad, etc. Semper ubi sub ubi.
Archivist/librarian. Former editor. Occasional bookbinder. Just a guy who loves books. He/him π³οΈβπ
"A library is a focal point, a sacred place to a community; and its sacredness is its accessibility, its publicness. Itβs everybodyβs place." βUrsula K Le Guin
Associate professor of computer science at Northeastern University. Natural language processing, digital humanities, OCR, computational bibliography, and computational social sciences. Artificial intelligence is an archival science.
Associate Professor of Political Science, University of Illinois Chicago. Author of The Thinkers: The Rise of Partisan Think Tanks and the Polarization of American Politics. Also, baseball.
πΈcocktails π policy π data
πMorgantown, West Virginia, Appalachia
π https://www.linkedin.com/in/samuel-workman
π https://samuelworkman.owlstown.net
π https://open.substack.com/pub/samuelworkman
Associate University Librarian for Digital Libraries at the University of North Texas
Open Data Consultant for @eleutherai.bsky.social & Digital History Advisor for @eui-history.bsky.social. Website: https://www.storytracer.com/
asst prof of computer science at cu boulder
nlp, cultural analytics, narratives, communities
books, bikes, games, art
https://maria-antoniak.github.io
New England's leading source for breaking news and analysis.
bostonglobe.com
A public search system for 10+ million government PDFs available at: https://govscape.net. Follow for updates, feature announcements, and more. Maintained by @bcgl.bsky.social
Assistant Professor @ the University of Washington iSchool | formerly an Innovator in Residence @ Library of Congress | essays in WIRED, Gawker, The New Republic, Longreads, Current Affairs, etc.
π www.bcglee.com
Software Dev Engineer @ King County Library System
Prior Research Assistant @ UW eScience Institute
Prior Linked Data Metadata Specialist @ UW Libraries
CEO of Bluesky, steward of AT Protocol.
dec/acc π± πͺ΄ π³