Race condition between processing and scraping

There is a race condition that exists when the scrape is happening while all the per datacenter metrics are being incremented. When the results are processed from the real-time stats API it’s iterating and incrementing the metrics [per-datacenter](https://github.com/fastly/fastly-exporter/blob/main/pkg/realtime/process.go#L12). If the scrape happens during that processing loop, the metrics that are reported won’t include all metrics for all datacenters since the response from the realtime API hasn’t finished processing yet. Therefore that scrape is reporting all the data from the last second of realtime data. I was able to easily reproduce by adding an artificial delay in the processing loop to force the scrape to happen in the middle of the loop. This can cause interesting graphs when running queries like:

```
(sum(rate(fastly_rt_requests_total[1m])) by(service_id)- (
sum(rate(fastly_rt_tls_total[1m]))by(service_id) ))
```

This line should be flat:

<img width="1109" alt="Screen Shot 2023-07-06 at 3 36 29 PM" src="https://github.com/fastly/fastly-exporter/assets/23857/75892344-3753-408f-ad0a-7b8041358f9b">

A potential solution is to add some locking so that every scrape is guaranteed to have a full set of data from any given response from the API. This has some performance implications especially when running against many services.

Thanks to @mrnetops for reporting.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Race condition between processing and scraping #144

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Race condition between processing and scraping #144

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions