feature: warm-up cache #14924

AlexsanderHamir · 2025-09-25T22:59:20Z

Title

Pre-load Users

Relevant issues

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
I have added a screenshot of my new test passing locally
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🆕 New Feature

Changes

If a user already has a database and starts or restarts the proxy server, they can now choose to load the most recent users into memory.

Observations

This feature has space to be expended upon, maybe loading the most active users instead of the most recent

Performance Improvements

When this configuration is enabled, it prevents the initial latency spikes seen in load balancing tests, where all concurrent users would otherwise hit the database directly.
It was observed that latency is way more stable, and recovers quickly from spikes.

With Cache Warmup

Without Cache Warmup

vercel · 2025-09-25T22:59:26Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
litellm	Error			Sep 26, 2025 6:25pm

ishaan-jaff

can you add details on perf improvement this PR showed on testing @AlexsanderHamir ?

ishaan-jaff · 2025-09-25T23:12:41Z

litellm/proxy/utils.py

                    response = await self.db.litellm_endusertable.find_many(
-                        where={"budget_id": {"in": budget_id_list}}
+                        where={"budget_id": {"in": budget_id_list}},
+                        order={"litellm_budget_table": {"created_at": "desc"}},


why change this ?

It was a mistake, thank you for catching it.

ishaan-jaff · 2025-09-25T23:13:06Z

litellm/proxy/proxy_server.py

        )

+    ### PRELOAD USERS INTO CACHE ###
+    ProxyStartupEvent._start_user_preload_background_task(


can this be an asyncio.create_task, so it does not block startup

ishaan-jaff

reviewed

ishaan-jaff · 2025-09-26T04:55:34Z

litellm/proxy/proxy_server.py


+    ### PRELOAD USERS INTO CACHE ###
+    if prisma_client is not None and general_settings is not None:
+        preload_limit = general_settings.get("preload_users_limit", 0)


@AlexsanderHamir we want this to run by default. the current code requires the user to opt into this by setting it on general_settings

ishaan-jaff

reviewed

ishaan-jaff · 2025-09-26T17:55:37Z

litellm/proxy/_types.py

        description="[DEPRECATED] Use 'user_header_mappings' instead. When set, the header value is treated as the end user id unless overridden by user_header_mappings.",
    )
    user_header_mappings: Optional[List[UserHeaderMapping]] = None
+    preload_users_limit: Optional[int] = Field(


we don't need this

ishaan-jaff · 2025-09-26T17:56:00Z

litellm/constants.py

 MAX_SIZE_PER_ITEM_IN_MEMORY_CACHE_IN_KB = int(
    os.getenv("MAX_SIZE_PER_ITEM_IN_MEMORY_CACHE_IN_KB", 1024)
 )  # 1MB = 1024KB
+_DEFAULT_CACHE_WARMUP_USERS = 100


call it DEFAULT_CACHE_WARMUP_USERS and allow it to be overrideable using env vars

ishaan-jaff · 2025-09-26T17:56:17Z

litellm/proxy/proxy_server.py

+    if prisma_client is not None:
+        default_preload_limit = _DEFAULT_CACHE_WARMUP_USERS
+        preload_limit = (
+            general_settings.get("preload_users_limit", default_preload_limit)


no need for general settings just use the constant

feature: warm-up cache

c930476

vercel bot had a problem deploying to Preview September 25, 2025 23:07 Failure

AlexsanderHamir force-pushed the litellm_2_perf_prs branch from 2d9c075 to c930476 Compare September 25, 2025 23:13

ishaan-jaff requested changes Sep 25, 2025

View reviewed changes

fix: remove mistake

729ad61

vercel bot had a problem deploying to Preview September 25, 2025 23:27 Failure

ishaan-jaff requested changes Sep 26, 2025

View reviewed changes

fix: make it the default

710b72e

vercel bot had a problem deploying to Preview September 26, 2025 14:52 Failure

fix: incorrect import

fe98679

vercel bot had a problem deploying to Preview September 26, 2025 14:58 Failure

ishaan-jaff requested changes Sep 26, 2025

View reviewed changes

fix: removed general settings

3f95861

vercel bot had a problem deploying to Preview September 26, 2025 18:00 Failure

fix: rename & remove unnecessary code

cdacd1e

vercel bot had a problem deploying to Preview September 26, 2025 18:21 Failure

fix: import name

dd9e6c0

vercel bot had a problem deploying to Preview September 26, 2025 18:25 Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feature: warm-up cache #14924

feature: warm-up cache #14924

AlexsanderHamir commented Sep 25, 2025 •

edited

Loading

Uh oh!

vercel bot commented Sep 25, 2025 •

edited

Loading

Uh oh!

ishaan-jaff left a comment

Uh oh!

ishaan-jaff Sep 25, 2025

Uh oh!

AlexsanderHamir Sep 25, 2025

Uh oh!

ishaan-jaff Sep 25, 2025

Uh oh!

ishaan-jaff left a comment

Uh oh!

ishaan-jaff Sep 26, 2025

Uh oh!

ishaan-jaff left a comment

Uh oh!

ishaan-jaff Sep 26, 2025

Uh oh!

ishaan-jaff Sep 26, 2025

Uh oh!

ishaan-jaff Sep 26, 2025

Uh oh!

Uh oh!

Uh oh!

feature: warm-up cache #14924

Are you sure you want to change the base?

feature: warm-up cache #14924

Conversation

AlexsanderHamir commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Title

Relevant issues

Pre-Submission checklist

Type

Changes

Observations

Performance Improvements

With Cache Warmup

Without Cache Warmup

Uh oh!

vercel bot commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ishaan-jaff left a comment

Choose a reason for hiding this comment

Uh oh!

ishaan-jaff Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

AlexsanderHamir Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

ishaan-jaff Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

ishaan-jaff left a comment

Choose a reason for hiding this comment

Uh oh!

ishaan-jaff Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

ishaan-jaff left a comment

Choose a reason for hiding this comment

Uh oh!

ishaan-jaff Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

ishaan-jaff Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

ishaan-jaff Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AlexsanderHamir commented Sep 25, 2025 •

edited

Loading

vercel bot commented Sep 25, 2025 •

edited

Loading