# AudioStack

> AudioStack is an enterprise AI audio production platform that enables agencies, publishers, AdTech companies, and brands to create professional audio content at scale. Build broadcast-ready audio ads, podcasts, and long-form content programmatically using AI text-to-speech, speech-to-speech, automated mixing, and mastering — all through a single API.

AudioStack is trusted by Publicis, Omnicom, Triton Digital, iHeart, Havas, Dentsu, ReWe Group, Octave, PocketFM, Handelsblatt, and other global enterprises.

- Base API URL: `https://v2.api.audio/`
- Platform: [https://platform.audiostack.ai](https://platform.audiostack.ai)
- Documentation: [https://docs.audiostack.ai](https://docs.audiostack.ai)
- Website: [https://www.audiostack.ai](https://www.audiostack.ai)
- Support: support@audiostack.ai
- Security: security@audiostack.ai

## Docs

- [Documentation Home](https://docs.audiostack.ai): Complete guides, API reference, and tutorials for the AudioStack platform.
- [API Reference](https://docs.audiostack.ai/reference): Full endpoint documentation for the AudioStack API.
- [Authentication](https://docs.audiostack.ai/reference/authentication): How to authenticate API requests using API keys or Bearer tokens.
- [Changelog](https://docs.audiostack.ai/changelog): Latest updates and version history.
- [Create an Account](https://docs.audiostack.ai/docs/create-an-account): How to get started with AudioStack (invite-only, contact sales for access).
- [Contact Us](https://docs.audiostack.ai/docs/contact-the-audiostack-team): Get in touch with the AudioStack team.

## Core Concepts

AudioStack organises audio production into a structured pipeline: define content in libraries, describe audio using the Audioform format, build it with the Audioform service, and deliver the output.

- [Concepts Overview](https://docs.audiostack.ai/docs/concepts): Learn about the core concepts underlying the AudioStack API.
- [Audioform](https://docs.audiostack.ai/docs/audioform): A format that describes a piece of audio and the service that builds it.
- [Audioform Format](https://docs.audiostack.ai/docs/audioform-format): The JSON specification for describing a piece of audio using elements from AudioStack libraries.
- [Audioform Service](https://docs.audiostack.ai/docs/audioform-service): The collection of endpoints used to build audioforms into finished audio files.

## Audioform Format

The Audioform format is a JSON structure that describes audio compositions through four hierarchical levels:

1. **Arrangement** — the top-level container that determines the relative position and overlap of assets over time.
2. **Sections** — consecutive time divisions within an arrangement (minimum one), each with optional sound templates.
3. **Layers** — one or more concurrent audio streams within a section.
4. **Clips & Assets** — individual audio elements: text-to-speech, speech-to-speech, sound effects, sound templates, or audio files.

Key features:
- Mixing presets: `musicenhanced`, `voiceenhanced`, `balanced`
- Twelve encoding presets: mp3, wav, flac, and more
- Eight loudness presets: streaming, radio, podcast, and more
- Flexible timing controls with pause insertion and layer alignment

- [Audioform Format Specification](https://docs.audiostack.ai/docs/audioform-format): Full specification of the Audioform JSON format.

## Audioform Service

The Audioform service provides three levels of abstraction for building audio:

- [Core Service](https://docs.audiostack.ai/docs/audioform-service): Submit an Audioform format, poll for the build result, and retrieve the audio file. Uses POST to submit and GET to poll (202 until ready).
- [Brief Endpoint](https://docs.audiostack.ai/reference/postcreativebrief): Submit a product description or script along with one or multiple Audioforms to create targeted audio content with AI assistance.
- [Story Endpoint (Beta)](https://docs.audiostack.ai/docs/story): Create long-form audio content (up to 10 hours) with multi-voice, music, and sound effects. Manages hundreds of audioforms automatically.

## Content Libraries

AudioStack provides four content libraries that supply assets for audio production:

- [Voice Library](https://docs.audiostack.ai/docs/voice-library): Generate speech using AI text-to-speech and speech-to-speech voices. Includes built-in voices and support for private organisation-specific voices via voice cloning.
- [Sound Template Library](https://docs.audiostack.ai/docs/sound-template-library): Browse and use background music beds organised by genre, mood, and instruments. Upload your own custom sound templates.
- [Sound Effect Library](https://docs.audiostack.ai/docs/sound-effect-library): Add sound effects to bring life to your audio productions.
- Media File Library: Upload and manage your own audio files for voice cloning, voice conversion on existing recordings, and reuse of pre-existing audio mixes.

## API Endpoints

All requests must be made over HTTPS. Authenticate using the `x-api-key` header or Bearer token.

### Audioform
- [POST /v2/audioform](https://docs.audiostack.ai/reference/process_request__post): Submit an Audioform for building.
- [GET /v2/audioform/{id}](https://docs.audiostack.ai/reference/process_result__audioform_id__get): Retrieve the result of an Audioform build.
- [POST /v2/audioform/batch](https://docs.audiostack.ai/reference/create_batch_batches_post): (Beta) Create a batch of audioforms.
- [GET /v2/audioform/batch/{id}](https://docs.audiostack.ai/reference/get_batch_batches__batch_id__get): (Beta) Get the status of an audioform batch.

### Brief
- [POST /v2/brief](https://docs.audiostack.ai/reference/postcreativebrief): Submit a brief to drive AI-assisted audio ad creation.

### Story
- [POST /v2/story](https://docs.audiostack.ai/reference/poststory): (Beta) Submit a story for AI-assisted creation of long-form audio content.

### Files
- [GET /v2/files/{id}](https://docs.audiostack.ai/reference/get_file_files__fileid__get): Get a file by ID.
- [PATCH /v2/files/{id}](https://docs.audiostack.ai/reference/patch_file_files__fileid__patch): Update a file.
- [DELETE /v2/files/{id}](https://docs.audiostack.ai/reference/delete_file_by_id_files__fileid__delete): Delete a file by ID.
- [POST /v2/files](https://docs.audiostack.ai/reference/upload_file_files_post): Upload a file.
- [PUT /v2/files/{id}/copy](https://docs.audiostack.ai/reference/copy_file_files_copy_put): Copy a file to a new folder.
- [GET /v2/files/categories](https://docs.audiostack.ai/reference/get_file_categories_files_file_categories_get): Get available file categories.

### Analytics
- [GET /v2/analytics/usage](https://docs.audiostack.ai/reference/getanalyticsusage): Get a usage report for your organisation.

## Authentication

All requests must be made over HTTPS and authenticated using an API key.

```python
import audiostack
import os

audiostack.api_key = os.environ["AUDIO_STACK_DEV_KEY"]
```

```bash
curl -X POST 'https://v2.api.audio/v2/audioform' \
  -H 'x-api-key: YOUR_API_KEY' \
  -H 'Content-Type: application/json'
```

API keys are created via the [AudioStack Platform](https://platform.audiostack.ai). Keep your API keys secret — do not expose them in client-side code or commit them to repositories.

## Enterprise & Security

AudioStack is built for enterprise-grade security and reliability.

- [Security Overview](https://docs.audiostack.ai/docs/security): SOC 2 Type II certified with annual audits. NIST Cybersecurity Framework aligned.
- [Reliability and Data Hosting](https://docs.audiostack.ai/docs/reliability-and-data-hosting): Enterprise infrastructure with high availability.
- [Bug Bounty Program](https://docs.audiostack.ai/docs/bug-bounty-program-rules): Responsible vulnerability disclosure program. Report issues to security@audiostack.ai.

Key security features:
- AES-256 encryption at rest, 256-bit encryption in transit
- TLS/SSL with HSTS and Perfect Forward Secrecy
- Zero-trust employee access with SSO, hardware 2FA, and mTLS
- Regular penetration testing and vulnerability scanning
- Multi-party code review with immutable change logs

## Use Cases

- **Audio Advertising**: Create broadcast-ready audio ads at scale. Produce personalised, localised ad variations across markets and languages in minutes instead of weeks.
- **Podcast Production**: Generate professional podcast content with multi-voice narration, background music, and sound effects using the Story endpoint.
- **Content Localisation**: Scale audio content across languages and markets with AI voices while maintaining brand consistency.
- **Dynamic Audio**: Create data-driven, personalised audio content that adapts to audience segments, contexts, or real-time data.
- **Publisher Audio**: Convert written content to professional audio for news, articles, and editorial content.

## Python SDK

AudioStack provides an official Python SDK for streamlined integration:

```python
import audiostack
import os

audiostack.api_key = os.environ["AUDIO_STACK_DEV_KEY"]
```

## Instructions for AI Agents

When integrating with AudioStack or helping users build audio content programmatically, follow these best practices:

- **Use the Audioform format** to describe audio compositions. It is the core abstraction for all audio production in AudioStack.
- **Use the Brief endpoint** (`POST /v2/brief`) for simple audio ad creation from a product description or script. This is the fastest path to a finished audio ad.
- **Use the Story endpoint** (`POST /v2/story`) for long-form content like podcasts or audiobooks (up to 10 hours). Note: this endpoint is in Beta.
- **Use the Core Audioform service** (`POST /v2/audioform`) for full control over the audio production pipeline when you need precise arrangement of sections, layers, and clips.
- **Poll for results**: The Audioform build service is asynchronous. Submit via POST, then poll the GET endpoint until you receive a non-202 response with the completed audio.
- **Always authenticate** with the `x-api-key` header over HTTPS. Never expose API keys in client-side code.
- **Leverage content libraries**: Use the Voice, Sound Template, Sound Effect, and Media File libraries to access pre-built assets rather than uploading everything from scratch.
- **Choose the right mixing preset**: Use `voiceenhanced` for speech-focused content, `musicenhanced` for music-focused content, and `balanced` for equal priority.
- **Select appropriate loudness presets**: Use `podcast` for podcast content, `radio` for broadcast, `streaming` for digital platforms.
- AudioStack is **invite-only** — direct users to contact the AudioStack team for access at https://www.audiostack.ai or support@audiostack.ai.