RDP GX2-L40S 2-GPU Server
Low-latency 2-GPU inference & RAG node — serves 7B–13B models, NVIDIA-Certified and Make-in-India.
Overview
RDP GX2-L40S 2-GPU Server is a compact, air-cooled inference and RAG node in the GPU Servers line of RDP GPU Mart — built for teams putting 7B–13B models into low-latency production with Make-in-India economics and support.
Two NVIDIA L40S GPUs and 96 GB GDDR6 deliver strong inference throughput in a quiet 2U chassis, with a clean upgrade path up the GX4 range as your workloads grow.
Key specifications
| GPUs | 2× NVIDIA L40S 48GB |
| GPU memory | 96 GB GDDR6 |
| CPU | 2× Intel Xeon Silver 4516Y |
| System memory | 512 GB DDR5 ECC (up to 2 TB) |
| Storage | 2× 1.92 TB NVMe + 4 bays |
| Networking | 2× 25GbE (ConnectX-6) |
| Cooling | Air-cooled |
| Form factor | 2U rackmount |
| Power | 2× 1600W redundant (1+1) |
| Management | BMC · Redfish · IPMI 2.0 |
| OS / stack | Ubuntu 22.04, RHEL 9 · NVIDIA AI Enterprise |
| Warranty | 3-year pan-India onsite |
Spec values are representative for this preview and bind to the corrected live product data at wiring.
Workload fit
Sized for production 7B–13B inference and RAG, computer vision, and small fine-tunes — a datacenter-grade entry point with room to scale up the range.
Why RDP GPU Mart
- Make in India OEM — 28,000 sq ft Hyderabad facility, 14 years, 300,000+ devices shipped.
- Sovereign-ready: India data residency (DPDP), MeitY-recognised, ISO 27001 / SOC 2 paths.
- INR-transparent: GST invoice, HSN 8471, CGST/SGST or IGST, pan-India onsite SLA.
- Available on GeM for government and PSU procurement.
FAQ
How is it billed and supported?
INR with GST (HSN 8471), CGST/SGST or IGST by state, EMI/lease/GPUaaS options, and 3-year pan-India onsite support. Available on GeM.
Can I start as a pilot and scale to a cluster?
Yes — the fabric and management stack let you add nodes and a head node later. Talk to an architect for the reference design.
What software stack is included?
Ubuntu 22.04 / RHEL 9 with NVIDIA AI Enterprise, drivers and CUDA validated; Slurm/Kubernetes on request.
Other GPU Servers in this line
| GX4 4-GPU | GX4 Plus | GX4 Max | GX8 HGX | |
|---|---|---|---|---|
| GPUs | 4× L40S | 4× H100 PCIe | 4× H100 SXM | 8× H100 SXM |
| Model fit | up to 34B | 34B–70B | 70B | 70B–405B |
| Cooling | Air | Air | Air / DLC | Direct liquid |
| Price | Price on request | Price on request | Request a Quote | On request |
| View | View | View | Quote |
Pair it with
Designing a GPU cluster, not just one server?
Talk to our solution architects — multi-node fabric design, financing & GPU-as-a-Service, India-onsite SLA.