One thing that makes streetcomplete a bit of a chore for me is entering opening times: depending on where you are, you might have to specify it differently for every single day, resulting in a lot of typing. That is more time I stare at my phone and use my hands in an unergonomic position, leading to soreness in neck, hands etc.
So I was thinking: Surely we are technically able to extract those data from an image?
If I would have time to implement it (which unfortunately I won’t I think) I’d just try to find a prompt that works well, send it to an LLM like Mistral (bc. in EU), get the info back, let it get checked by the user and then enter it to openstreetmaps. That would require only me to provide my api token, which I would be willing to pay for (assuming my estimation of this being in the cents volume are right).
Now:
- Does something like that exist and I just don’t know about it?
- If somebody is willing to try that out I can provide some example data, collect more and test it.
@DonnerWolfBach There were some experiements by @ian and @zverik with extracting osm tags from pictures using gpt-4o-mini in everydoor: https://en.osm.town/@everydoor/113192204761150958
The model is pretty out of date these days, so I imagine there’s room for improvement!
#OpenStreetMap
@ame @DonnerWolfBach @ian @zverik Maybe this one? https://image-to-osm.vercel.app