Skip to content

Conversation

@Andrew-Dickinson
Copy link
Member

The Datadog logs are filled with errors delivering notifications to slack :(

[2025-06-29 12:20:12,522: ERROR/ForkPoolWorker-5] Got HTTP 404 while sending slack notification to slack admin. HTTP response was no_service. Unable to notify admins of the following message:

I believe this is a result of rate limiting. This PR adds retries with exponential backoff to the notification delivery system

@Andrew-Dickinson Andrew-Dickinson force-pushed the andrew/admin-notification-retries branch from 453e273 to ba0186a Compare June 29, 2025 17:14
@Andrew-Dickinson Andrew-Dickinson force-pushed the andrew/admin-notification-retries branch from ba0186a to db1b82d Compare June 29, 2025 17:15
@codecov
Copy link

codecov bot commented Jun 29, 2025

Codecov Report

Attention: Patch coverage is 92.30769% with 1 line in your changes missing coverage. Please review.

Project coverage is 95.06%. Comparing base (40dd5ec) to head (db1b82d).

Files with missing lines Patch % Lines
src/meshapi/util/admin_notifications.py 92.30% 1 Missing ⚠️
Additional details and impacted files
@@           Coverage Diff           @@
##             main     #958   +/-   ##
=======================================
  Coverage   95.05%   95.06%           
=======================================
  Files          98       98           
  Lines        4245     4253    +8     
=======================================
+ Hits         4035     4043    +8     
  Misses        210      210           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Copy link
Collaborator

@WillNilges WillNilges left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One small suggestion.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this code run synchronously anywhere that having an exponential backoff like this could be a problem? What if we sent the slack messages to the celery worker?

If we want to have slack messages sent during a Join Form processing, for example, this could be problematic.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants