> ## Documentation Index
> Fetch the complete documentation index at: https://docs.modelslab.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Vocal Isolator

> Separates the vocals from the background music of a video and audio file

## Request

Make a `POST` request to below endpoint and pass the required parameters as a request body.

```curl curl theme={null}
--request POST 'https://modelslab.com/api/v6/voice/vocal_isolator' \
```

## Body

<Note>
  Either `init_video` or `init_audio` is required. If both are provided, the `init_video` takes precedence.
</Note>

```json json theme={null}
{    
    "key": "your_api_key", 
    "init_video": "https://assets.modelslab.ai/generations/a85db55c-dca2-415c-97a5-51192916e61b.mp3", 
    "seed":234322, 
    "base64": false,
    "webhook": null, 
    "track_id": null   
}
```


## OpenAPI

````yaml POST /voice/vocal_isolator
openapi: 3.1.0
info:
  title: ModelsLab Voice API
  description: >-
    A comprehensive API for AI-driven voice and audio generation including
    text-to-speech, voice cloning, music generation, and audio processing
    capabilities
  license:
    name: MIT
  version: 6.0.0
servers:
  - url: https://modelslab.com/api/v6
security: []
paths:
  /voice/vocal_isolator:
    post:
      summary: Isolate vocals from audio
      description: Separates vocals from background music in audio or video files
      requestBody:
        required: true
        content:
          application/json:
            schema:
              $ref: '#/components/schemas/VocalIsolatorRequest'
      responses:
        '200':
          description: Vocal isolation response
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/VoiceResponse'
        '400':
          description: Bad request
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/Error'
components:
  schemas:
    VocalIsolatorRequest:
      type: object
      required:
        - key
      properties:
        key:
          type: string
          description: API key for authentication
        init_video:
          type: string
          format: uri
          description: Video URL to isolate vocals from
        init_audio:
          type: string
          format: uri
          description: Audio URL to isolate vocals from
        base64:
          type: boolean
          default: false
          description: Whether input is in base64 format
        seed:
          type: integer
          description: Random seed for reproducibility (0 for random)
        webhook:
          type: string
          format: uri
          description: URL to receive POST notification upon completion
        track_id:
          type: integer
          description: ID for webhook identification
    VoiceResponse:
      type: object
      properties:
        status:
          type: string
          enum:
            - success
            - processing
            - error
          description: Status of the voice generation
        generationTime:
          type: number
          description: Time taken to generate the audio in seconds
        id:
          type: integer
          description: Unique identifier for the voice generation
        output:
          type: array
          items:
            type: string
            format: uri
          description: Array of generated audio URLs
        proxy_links:
          type: array
          items:
            type: string
            format: uri
          description: Array of proxy audio URLs
        future_links:
          type: array
          items:
            type: string
            format: uri
          description: Array of future audio URLs for queued requests
        links:
          type: array
          items:
            type: string
            format: uri
          description: Array of audio URLs (voice cover response)
        meta:
          type: object
          description: Metadata about the audio generation including all parameters used
        eta:
          type: integer
          description: Estimated time for completion in seconds (processing status)
        message:
          type: string
          description: Status message or additional information
        tip:
          type: string
          description: Additional information or tips for the user
        fetch_result:
          type: string
          format: uri
          description: URL to fetch the result when processing
        audio_time:
          type: number
          description: Duration of the generated audio in seconds
    Error:
      type: object
      required:
        - status
        - message
      properties:
        status:
          type: string
          enum:
            - error
        message:
          type: string
          description: Error message description

````