oh hey just as a PSA, any code any of y'all may have on github might have been scraped by @swheritage, a self-proclaimed "preservation" org which
- just hoovered up vast amounts of data without asking or telling anyone
- insists on deadnaming trans people forever for "integrity" reasons
- used it to build an LLM training data set
https://huggingface.co/spaces/bigcode/in-the-stack to check and for opt-out instructions