What's the best way to monitor an API for breaking changes?

Clay_pidgin@sh.itjust.works · 4 months ago

What's the best way to monitor an API for breaking changes?

yaroto98@lemmy.world · 4 months ago

Do they use openapi or swagger or something? If so you should be able to do something like use changedetect.io on their swaggerdocs page.

Clay_pidgin@sh.itjust.works · 4 months ago

They generate a swaggger file for me on request with a lag time of weeks usually, but for only one of the APIs. The others are documented in emails basically. This is a B2B type of thing, they are not publicly available APIs.

nomad@infosec.pub · 4 months ago

Ask them to generate a schema file that you can download from the api. Or at least an endpoint that returns a hash of the current api schema file. That’s cheap versioning telling you if something changes.

You can always use the swagger schema to verify the api. So ask some basic questions what should always be true and put that into validation scripts. If they use a framework, HEAD requests usually tell you some things.

Last really bad vendor had an openapi page that listed the endpoints but the api wouldn’t adhere to the details given there. I discovered that their website used the api all the time and surfing that i was able to discover which parameters were required etc.

Last idea is statistics. Grab any count data you can get, like from pagination data and create a baseline of available data over time. That gives you an expected count and you can detect significant divergences.

I tend to show up at the vendors it guys in person and bribe them into helping me behind their bosses backs. Chocolate, coffee and some banter can do wonders.

Clay_pidgin@sh.itjust.works · edit-2 4 months ago

I’m 3,500 miles from the vendor’s devs, sadly.

Asking them to put the swagger file itself behind the API is a good idea. Their dev backlog is 3-24 months.

I used the same trick to determine the required headers and parameters - I checked their website which uses the same API.

The source of their delays is that different devs or teams “own” different endpoints and make their changes without documenting. It’s annoying, stuff like the same data being in field “hostId” on one endpoint but “deviceId” on another.

CrypticCoffee@lemmy.ml · 4 months ago

This is why you have requirements which are agreed upon and affect payment if not upheld. If you start being firmer, they might move quicker. 24 month lead team is bullshit.

Clay_pidgin@sh.itjust.works · 4 months ago

They have accepted the penalties as the cost of doing business, and the decision makers on my side are worried about opening it up again. It’s a custom hardware + custom software thing so there aren’t that many options!

nomad@infosec.pub · 4 months ago

Just build a few selenium Tests to ensure the API requests the website performs don’t change without you noticing :)

Clay_pidgin@sh.itjust.works · 4 months ago

That’s not a bad idea. Usually, so far, their frontend team doesn’t hear about the changes either!

nomad@infosec.pub · 4 months ago

Wow that’s bad practice. Sell your monitoring to them to help improve their quality.

Clay_pidgin@sh.itjust.works · 4 months ago

I honestly think we provide a significant impetus for improvement on their side. They have lots of other customers, but most aren’t as involved and embedded in the data as we are.

yaroto98@lemmy.world · 4 months ago

Are any of their apis a GET that returns lists? I create a lot of automated api tests. You might be able to GET a list of users (or whatever) then pick a random 10 user_ids and query another api, say user_addresses and pass in each id one at a time and verify a proper result. You don’t have to verify the data itself, just that the values you care about are not empty and they key exists.

You can dynamically test a lot this way and if a key gets changed from ‘street’ to ‘street_address’ your failing tests should let you know.

Clay_pidgin@sh.itjust.works · 4 months ago

Unfortunately on the main API I use of theirs, there’s an endpoint with a list of objects and their IDs, and those IDs are used everywhere else. The rest of the endpoints aren’t connected. I can’t walk e.g. school > students > student > grades or something

yaroto98@lemmy.world · 4 months ago

I made my career out of automated testing with a focus on apis. I’m not aware of any easy tool to do what you want. The easiest way to quick whip up basic api tests that I’ve found is python/pytest with requests. You can parameterize lots of inputs, run tests in parallel, easily add new endpoints as you go, benchmark the apis for response times, etc. It’ll take a lot of work in the beginning, then save you a lot of work in the end.

Now, AI will be able to make the process go faster. If you give it a sample input and output it can do 95% of a pytest in 10s. But beware that last 5%.

jjjalljs@ttrpg.network · 4 months ago

Yeah I would use python and pytest, probably.

You need to decide what you expect to be a passing case. Known keys are all there? All values in acceptable range? Do you have anything where you know exactly what the response should be?

How many endpoints are there?