Skip to content

Label Studio allows Server-Side Request Forgery in the S3 Storage Endpoint

High severity GitHub Reviewed Published Feb 14, 2025 in HumanSignal/label-studio • Updated Feb 14, 2025

Package

pip label-studio (pip)

Affected versions

< 1.16.0

Patched versions

1.16.0

Description

Description

Label Studio's S3 storage integration feature contains a Server-Side Request Forgery (SSRF) vulnerability in its endpoint configuration. When creating an S3 storage connection, the application allows users to specify a custom S3 endpoint URL via the s3_endpoint parameter. This endpoint URL is passed directly to the boto3 AWS SDK without proper validation or restrictions on the protocol or destination.

The vulnerability allows an attacker to make the application send HTTP requests to arbitrary internal services by specifying them as the S3 endpoint. When the storage sync operation is triggered, the application attempts to make S3 API calls to the specified endpoint, effectively making HTTP requests to the target service and returning the response in error messages.

This SSRF vulnerability enables attackers to bypass network segmentation and access internal services that should not be accessible from the external network. The vulnerability is particularly severe because error messages from failed requests contain the full response body, allowing data exfiltration from internal services.

Steps to reproduce

  1. Create an account in Label Studio

  2. Create a new project with basic configuration

  3. Create an S3 storage connection with the following configuration:

    {
      "project": 1,
      "title": "Test Storage",
      "bucket": "<filename>",
      "s3_endpoint": "http://internal-web",
      "use_blob_urls": true,
      "aws_access_key_id": "test",
      "aws_secret_access_key": "test"
    }
  4. Trigger a storage sync operation by sending a POST request to /api/storages/s3/[storage_id]/sync

The application will attempt to connect to the specified endpoint URL as if it were an S3 service. When the request fails due to invalid S3 API responses, the error message will contain the raw response from the internal service, allowing access to internal resources.

Mitigations

  • Implement strict validation of S3 endpoint URLs to allow only valid S3service endpoints
  • Add an allowlist of endpoint domains and protocols
  • Sanitize error messages to prevent leakage of sensitive information from failed requests
  • Consider implementing network-level controls to restrict outbound connections from the application server

Impact

This vulnerability has high severity as it allows authenticated users to make requests to arbitrary internal services from the application server, potentially exposing sensitive internal resources and bypassing network segmentation. The inclusion of response data in error messages makes this particularly effective for data exfiltration.

References

@jombooth jombooth published to HumanSignal/label-studio Feb 14, 2025
Published to the GitHub Advisory Database Feb 14, 2025
Reviewed Feb 14, 2025
Published by the National Vulnerability Database Feb 14, 2025
Last updated Feb 14, 2025

Severity

High

CVSS overall score

This score calculates overall vulnerability severity from 0 to 10 and is based on the Common Vulnerability Scoring System (CVSS).
/ 10

CVSS v3 base metrics

Attack vector
Network
Attack complexity
Low
Privileges required
None
User interaction
None
Scope
Changed
Confidentiality
High
Integrity
None
Availability
None

CVSS v3 base metrics

Attack vector: More severe the more the remote (logically and physically) an attacker can be in order to exploit the vulnerability.
Attack complexity: More severe for the least complex attacks.
Privileges required: More severe if no privileges are required.
User interaction: More severe when no user interaction is required.
Scope: More severe when a scope change occurs, e.g. one vulnerable component impacts resources in components beyond its security scope.
Confidentiality: More severe when loss of data confidentiality is highest, measuring the level of data access available to an unauthorized user.
Integrity: More severe when loss of data integrity is the highest, measuring the consequence of data modification possible by an unauthorized user.
Availability: More severe when the loss of impacted component availability is highest.
CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:C/C:H/I:N/A:N

EPSS score

Weaknesses

CVE ID

CVE-2025-25297

GHSA ID

GHSA-m238-fmcw-wh58

Credits

Loading Checking history
See something to contribute? Suggest improvements for this vulnerability.