Horizon Alert
Summary of the vulnerability and why it matters
This advisory details a critical vulnerability in NVIDIA's GEN3C technology affecting its inference API server. An unauthenticated attacker can exploit this by sending a specially crafted request to specific endpoints, potentially leading to remote code execution on the server. The main concern is confirming if this technology is in use and if it's exposed to external networks.
- Unauthenticated code execution in NVIDIA inference servers.
- Critical vulnerability could allow attackers remote control.
- Confirm relevance and exposure of NVIDIA inference technology.
Attack Path
How an attacker could exploit the issue
Attackers can reach an NVIDIA SIL GEN3C inference API server over the network and send specially crafted HTTP requests to trigger a deserialization vulnerability. This vulnerability, located in the `/request-inference` and `/seed-model` endpoints, can lead to remote code execution on the server, allowing an attacker to control the inference process.
- Unauthenticated network access required.
- Deserializing untrusted HTTP request bodies.
- Remote code execution on the inference server.
Live Threat
Current exploitation, exposure, and threat context
The inference API server in NVIDIA SIL's GEN3C, when exposed to the network and without authentication, could allow attackers to execute arbitrary code by sending specially crafted data to its `/request-inference` or `/seed-model` endpoints. This occurs because the server deserializes raw HTTP request bodies using Python's pickle.loads() without proper validation. The affected system could be compromised, leading to unauthorized code execution as the inference process.
- System code execution.
- Unauthenticated network requests.
- Compromise of inference process.
Operational Fix
Recommended remediation, mitigation, and detection steps
The NVIDIA SIL GEN3C inference API server is susceptible to unauthenticated remote code execution due to insecure deserialization. Real-world responsibility likely falls to the platform or infrastructure teams managing the SIL deployment, in coordination with security teams for exposure assessment and vendor management if the GEN3C is a third-party component. The first practical step is to identify all instances of GEN3C, determine their network reachability and business criticality, and then plan remediation based on risk, potentially involving vendor engagement for a confirmed fix.
- Platform/Infrastructure owns the vulnerability.
- Verify network exposure and business criticality.
- Plan remediation based on risk and vendor advice.