Direct performance comparison between the RTX 4090 Pro and RTX 5090 across 21 standardized AI benchmarks collected from our production fleet. Testing shows the RTX 4090 Pro winning 3 out of 21 benchmarks (14% win rate), while the RTX 5090 wins 18 tests. All 21 benchmark results are automatically gathered from active rental servers, providing real-world performance data rather than synthetic testing.
In language model inference testing across 9 different models, the RTX 4090 Pro is 23% slower than the RTX 5090 on average. For deepseek-r1:32b inference, the RTX 4090 Pro reaches 45 tokens/s while the RTX 5090 achieves 71 tokens/s, making the RTX 4090 Pro significantly slower with a 37% deficit. Overall, the RTX 4090 Pro wins 1 out of 9 LLM tests with an average 28% performance difference, making the RTX 5090 the better option for LLM inference tasks.
Evaluating AI image generation across 12 different Stable Diffusion models, the RTX 4090 Pro is 95% slower than the RTX 5090 in this category. When testing sd3.5-medium, the RTX 4090 Pro completes generations at 6.1 s/image while the RTX 5090 achieves 4.5 s/image, making the RTX 4090 Pro significantly slower with a 26% deficit. Across all 12 image generation benchmarks, the RTX 4090 Pro wins 2 tests with an average 95% performance difference, making the RTX 5090 the better choice for Stable Diffusion, SDXL, and Flux workloads.
Order a GPU Server with RTX 4090 Pro All GPU Server Benchmarks
Loading benchmark data...
Our benchmarks are collected automatically from servers having gpus of type RTX 4090 Pro and RTX 5090 in our fleet using standardized test suites:
Note: RTX 4090 Pro and RTX 5090 AI Benchmark Results may vary based on system load, configuration, and specific hardware revisions. These benchmarks represent median values from multiple test runs of RTX 4090 Pro and RTX 5090.
Order a GPU Server with RTX 4090 Pro Order a GPU Server with RTX 5090 View All Benchmarks