cross-posted from: https://lemmy.ml/post/44059967
for those not familiar with Mark Pilgrim, he is/was a prolific author, blogger, and hacker who abruptly disappeared from the internet in 2011.
cross-posted from: https://lemmy.bestiver.se/post/968527
cross-posted from: https://lemmy.ml/post/44059967
for those not familiar with Mark Pilgrim, he is/was a prolific author, blogger, and hacker who abruptly disappeared from the internet in 2011.
cross-posted from: https://lemmy.bestiver.se/post/968527
So no one is going to say what chardet is, huh.
It’s a library for detecting which character encoding a string is encoded with.
Here are the docs for the vibe-coded rewrite, and here is the version before it.
The new vibe-coded version also adds language detection; it isn’t clear to me why the current version of the readme shows it classifying the string
"It’s a lovely day — let’s grab coffee."as Spanish with 99% confidence, without any comment in the docs about that being a misclassification, but I guess that if the LLM-authored program says it is then that must be one of those phrases that looks the same in Spanish as in English 👀