[torch.compile] Large cache size limit #2604

anijain2305 · 2024-12-26T18:48:27Z

Describe the bug

sglang/python/sglang/srt/model_executor/cuda_graph_runner.py

Lines 103 to 105 in 2125898

    
           torch._dynamo.config.accumulated_cache_size_limit = 1024 
        
           if hasattr(torch._dynamo.config, "cache_size_limit"): 
        
               torch._dynamo.config.cache_size_limit = 1024

From torch 2.5 version, we should not need such a large cache size limit. Is it possible for someone to double check and remove the override?

Reproduction

NA

Environment

NA

zhyncs · 2024-12-26T18:52:04Z

It was added at #2069 As I remember, we should set, otherwise something with FlashInfer will fail

zhyncs · 2024-12-26T18:53:20Z

We may test whether reducing the size is also compatible (not directly deleting).

anijain2305 · 2024-12-26T18:53:49Z

I see. Maybe a better way is to make FlashInfer kernels torch.compile compatible?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[torch.compile] Large cache size limit #2604

[torch.compile] Large cache size limit #2604

anijain2305 commented Dec 26, 2024 •

edited

Loading

zhyncs commented Dec 26, 2024

zhyncs commented Dec 26, 2024

anijain2305 commented Dec 26, 2024

[torch.compile] Large cache size limit #2604

[torch.compile] Large cache size limit #2604

Comments

anijain2305 commented Dec 26, 2024 • edited Loading

Describe the bug

Reproduction

Environment

zhyncs commented Dec 26, 2024

zhyncs commented Dec 26, 2024

anijain2305 commented Dec 26, 2024

anijain2305 commented Dec 26, 2024 •

edited

Loading