or, "i've got xxxxxxx thousand images in my inbox, what the fuck do i do?"
i'm currently having this problem myself, so i thought it would be nice to have a thread to collect information about various ways to automate sorting/tagging/etc, such as:
* image deduplication scripts:
>https://github.com/knjcode/imgdupes
i'm presently also writing my own as well which should work somewhat better for large datasets, i'll post it here when it's usable
* datasets, useful for running against aforementioned dedupe scripts
>danbooru2019, contains all danbooru pictures + metadata up to early 2019: >https://www.gwern.net/Danbooru2019
i remember there being large dumps of other booru metadata on here years ago, but i can't find them anymore
* AI/neural network software for automated tagging, classifiers and etc:
https://github.com/KichangKim/DeepDanbooru
https://github.com/imamar94/ramrem-classifier
things i couldn't find but would find extremely useful:
>AI anime/photograph classifier
this would be very helpful, especially with deduplicating from danbooru2019
>subject classifier
if there was a NN that could tell me if the subject of a picture was a person or something else, this would be insanely useful, especially for the next item
>SFW/NSFW classifier
about half of my collection is porn. i'm planning on deleting basically all of it, so just being able to get that out of the way quickly would cut my workload in half