Machine Readiness
Stored receipt and evidence
20
65
0
0
0
Samples
No stored offer samples.
Samples
No stored action samples.
Samples
No stored product samples.
Document
User-agent: * Allow: / # Content Signals (contentsignals.org / draft-romm-aipref-contentsignals): # Site-wide default. User-generated gallery content is excluded from # AI training via noai/noimageai meta tags on those pages. Content-Signal: search=yes, ai-input=yes, ai-train=yes # Sitemaps Sitemap: https://pixeldojo.ai/sitemap.xml Sitemap: https://pixeldojo.ai/sitemap-prompts.xml # AI Agent Discovery # Canonical entry points (also advertised via HTTP Link headers and # /.well-known/api-catalog per RFC 9727): # - /api-platform — Primary agent-facing hub # - /agents — Agent-oriented landing page # - /.well-known/api-catalog — RFC 9727 linkset+json index # - /.well-known/ai-plugin.json — AI plugin manifest # - /.well-known/agent-skills/index.json — Agent Skills Discovery (v0.2.0) # - /.well-known/oauth-protected-resource — RFC 9728 # - /llms.txt — llmstxt.org discovery file # - /llm.txt — LLM-optimized API reference (markdown) # - /api/openapi — OpenAPI 3.1 specification # - /api-docs — HTML API reference
Document
# PixelDojo
> AI image and video generation platform with a REST API.
## API
PixelDojo provides an async job-based API for programmatic access to AI image and video generation models.
- Base URL: https://pixeldojo.ai/api/v1
- Auth: Bearer token (API key)
- Endpoint: POST /models/{modelId}/run → GET /jobs/{jobId}
- Models available: 126
### Available Models
- change-camera-angle: Camera-aware editing via fal.ai Qwen Image Edit 2511 with multi-angle LoRA — 360° orbit, tilt, and zoom. (1 credit(s))
- consistent-characters: Generate consistent character variations with FLUX Kontext, Nano Banana Pro/2, Flux 2 Dev, or Qwen Image 2 Pro. (1 credit(s))
- creative-upscale: Clarity Upscaler (creative upscale) via Replicate — boost detail with stable-diffusion refinement. (0.5 credit(s))
- dreamina: ByteDance Dreamina 3.1 — 4MP cinematic text-to-image with precise style control. (1 credit(s))
- ernie: Baidu Ernie text-to-image (fal.ai). Multilingual prompts and built-in prompt expansion. (1 credit(s))
- face-enhance: Crystal Upscaler via Replicate — face-detail preserving upscale, cost scales with output megapixels. (2 credit(s))
- flux: FLUX family on Replicate — Schnell, Dev, Pro, Kontext, Ultra, and LoRA remix variants in one entrypoint. (1 credit(s))
- flux-2-flex: Max-quality with up to 10 reference images (1.5 credit(s))
- flux-2-klein-4b: Very fast generation and editing with up to 5 reference images (0.1 credit(s))
- flux-2-klein-9b: 4-step distilled FLUX.2 [klein] foundation model for flexible control (0.5 credit(s))
- flux-2-pro: High-quality with up to 8 reference images (1.5 credit(s))
- flux-2-max: The highest fidelity image model from Black Forest Labs (2 credit(s))
- flux-2-dev: Fast quality with up to 4 reference images (1 credit(s))
- flux-2-lora: Dev model with custom LoRA support (1 credit(s))
- flux-edit: Black Forest Labs FLUX.1 Kontext for text-driven image editing — Dev (open-weight), Pro (state-of-the-art), and Max (premium typography). (1 credit(s))
- flux-dev: High-quality development model with configurable steps, guidance, and LoRA support. (1 credit(s))
- flux-krea-dev: Photorealistic generation that avoids the oversaturated AI look. LoRA compatible. (1 credit(s))
- flux-dev-multi-lora: Supports multiple custom LoRAs simultaneously for complex style combinations. (1 credit(s))
- flux-1.1-pro: Latest pro model with enhanced quality and strong prompt adherence. (1 credit(s))
- flux-1.1-pro-ultra: Highest quality Flux model with raw mode for natural-looking images. (1.5 credit(s))
- flux-kontext-pro: Advanced model with state-of-the-art performance for both generation and editing. (1 credit(s))
- flux-kontext-max: Premium model with maximum performance and improved typography for generation and editing. (2 credit(s))
- gemini-flash: Fast generation with Gemini 2.5 Flash (1 credit(s))
- nano-banana-pro: SOTA with accurate typography and reasoning (3 credit(s))
- nano-banana-2: Next-generation SOTA model with stronger consistency (3 credit(s))
- google-nano-banana: Google Nano Banana image editing — multi-image fusion + edit instruction with Standard/Pro/Pro-fal tiers and 1K/2K/4K resolution. (3 credit(s))
- gpt-image-low: Fast, lower detail generation (1 credit(s))
- gpt-image-medium: Balanced quality and speed (1 credit(s))
- gpt-image-high: Maximum detail and quality (4 credit(s))
- gpt-image-2: OpenAI GPT Image 2 via fal.ai — next-generation image model with 4K rendering and sharper text fidelity. (5 credit(s))
- grok-r2v: xAI Grok Imagine reference-to-video via Replicate — 1–7 reference images plus prompt for 1–10s clips at 480p or 720p. (10 credit(s))
- grok-video-extend: xAI Grok Imagine video extension — continue an existing MP4 with a prompt-directed extension (2–10s). (12 credit(s))
- hailuo-standard: Premium quality text-to-video and image-to-video (8 credit(s))
- hailuo-fast: Fast image-to-video generation (4 credit(s))
- heygen-avatar: Heygen Avatar 4 via fal.ai — animate a portrait with prompt-driven speech or an audio track, with optional background and captions. (120 credit(s))
- hidream-l1-fast: HiDream L1 Fast - Fast generation (1 credit(s))
- hidream-l1-dev: HiDream L1 Dev - Fast generation (1 credit(s))
- hidream-l1-full: HiDream L1 Full - Highest quality (2 credit(s))
- hidream-e1.1: HiDream E1.1 - Fast generation (1 credit(s))
- hunyuan-3d: Tencent Hunyuan 3D 3.1 — generate 3D meshes from a text prompt or a single image. (4 credit(s))
- ideogram-character: Generate consistent characters from a single reference image in many styles. (5 credit(s))
- image-editor: One-shot FLUX Kontext variants — filters, cartoonify, iconic locations, haircut swap, headshots, renaissance, face-to-many, and more. (1 credit(s))
- image-relighting: Relight images with Magic Lighting, Nano Banana Pro/2, or Qwen Image Edit — multi-provider routing with per-model credit rates. (1 credit(s))
- image-to-image-flux: FLUX Dev LoRA image-to-image on Replicate — prompt + source image + optional LoRA weights. (1 credit(s))
- imagineart: Imagineart 1.5 Pro image generation (fal.ai). (1.5 credit(s))
- kling-image: Kling Image V3 (fal.ai) — high-quality text-to-image with flexible aspect ratios. (1 credit(s))
- kling-image-edit: Kling Image V3 (fal.ai) image-to-image editing with a text instruction. (1 credit(s))
- kling-motion-control: Kling Video v3 Standard motion control endpoint (3 credits/sec)
- kling-motion-control-pro: Kling Video v3 Pro motion control endpoint (4 credits/sec)
- kling-reference-to-video: Kling O3 reference-driven video generation — image or video references, Standard or Pro tier. (15 credit(s))
- kling-v2-6: Kling Video v2.6 Pro (fal.ai) — text-to-video or image-to-video, 5 or 10 seconds, with audio generation. (15 credit(s))
- kling-video-v3-standard-text: Standard text-to-video with native audio (6 credits/sec)
- kling-video-v3-standard-image: Standard image-to-video with native audio (6 credits/sec)
- kling-video-v3-pro-text: Pro text-to-video with cinematic quality and native audio (8 credits/sec)
- kling-video-v3-pro-image: Pro image-to-video with cinematic quality and native audio (8 credits/sec)
- kling-video-edit: Kling O3 video-to-video edit — Standard or Pro, with optional reference images and audio preservation. (40 credit(s))
- lip-sync: Replicate sync/lipsync-2 — align mouth movements in a video to a separate audio track. (5 credit(s))
- ltx-2-fast-t2v: Fast text-to-video generation (6-20s, 1080p-2160p). (2 credits/sec)
- ltx-2-fast-i2v: Fast image-to-video generation (6-20s, 1080p-2160p). (2 credits/sec)
- ltx-2-pro-t2v: Higher quality text-to-video generation (6-10s, 1080p-2160p). (2 credits/sec)
- ltx-2-pro-i2v: Higher quality image-to-video generation (6-10s, 1080p-2160p). (2 credits/sec)
- ltx-2-pro-extend: Extend an existing video clip from the start or end (1-20s, Pro tier only). (2 credits/sec)
- magnific-upscaler: Freepik Magnific upscaler — creative or precision mode, up to 16x. (3 credit(s))
- omnihuman: ByteDance OmniHuman 1.5 via Replicate — audio-driven talking-head video with lip sync. (45 credit(s))
- openai-image-1: OpenAI GPT Image 1 Mini — text-to-image via Replicate. (1 credit(s))
- outpaint: fal.ai Image Apps V2 outpainting — expand an image beyond its original edges. (1 credit(s))
- p-image: Pruna P-Image — sub-second text-to-image with optional custom dimensions. (0.1 credit(s))
- p-image-edit: Pruna P-Image Edit — fast image editing with up to 5 reference images. (0.25 credit(s))
- p-video: Pruna P-Video — video generation with text/image/audio conditioning, draft mode, and 720p/1080p outputs. (2.5 credit(s))
- pixverse: Pixverse v5.6 video generation via Replicate — text-to-video or image-to-video with optional audio, at 360p–1080p. (7.5 credit(s))
- pixverse-v6: Pixverse V6 video generation via Runware. Text-to-video, image-to-video (start frame), or multi-clip (start + end frame). (10 credit(s))
- ponyxl-ponyrealism-v23: Pony Realism - Stylized anime generation (1 credit(s))
- ponyxl-tponynai3-v7: Pony NAI - Stylized anime generation (1 credit(s))
- ponyxl-waianinsfwponyxl-v140: Wai ANI - Stylized anime generation (1 credit(s))
- qwen-image-plus: Fast generation with excellent quality (1 credit(s))
- qwen-image-max: Highest quality output (2 credit(s))
- qwen-image-2.0: Fast, balanced image generation and editing (1 credit(s))
- qwen-image-2.0-pro: Enhanced text rendering, realistic textures, and semantic adherence (2 credit(s))
- recraft-v4: Recraft's latest image model. Strong prompt accuracy, art-directed composition, integrated text rendering. Fast and cost-efficient at standard resolution. (1 credit(s))
- recraft-v4-pro: Recraft V4 at ~2048px resolution. Same design taste and prompt accuracy as V4, with higher resolution for print-ready and large-scale work. (6 credit(s))
- recraft-v4-svg: Production-ready SVG vector images from text. Recraft V4's design taste applied to vector output — clean geometry, structured layers, editable paths. (2 credit(s))
- recraft-v4-pro-svg: Detailed SVG vector graphics from text. Recraft V4 Pro's design taste with more geometric detail and finer paths — clean layers, editable output, scalable to any size. (8 credit(s))
- redux-flux: Black Forest Labs Flux Redux image variations — feed a source image, get stylistic riffs. (1 credit(s))
- runway-gen4-video: Runway Gen-4.5 video generation — text-to-video or image-to-video, 5 or 10 seconds. (15 credit(s))
- runway-video: Canonical version-agnostic Runway video API ID. (15 credit(s))
- runway-gen4: Legacy alias for clients pinned to runway-gen4; maps to the current Runway model. (15 credit(s))
- seedance-1.5: ByteDance Seedance 1 video generation — text-to-video or image-to-video with optional end frame. (8 credit(s))
- seedance-2-high: Higher-quality Seedance 2.0 video generation (supports 1080p) (4 credits/sec)
- seedance-2-reference: Seedance 2.0 multimodal reference-to-video. Combine up to 9 images, 3 video clips, and 3 audio tracks to guide characters, motion, and sound. (20 credit(s))
- seedance-video-edit: Edit source videos with Seedance 2.0 using prompted changes, optional reference images, and 480p, 720p, or 1080p output. (25 credit(s))
- seedream-3: ByteDance Seedream 3 text-to-image via Replicate. (1 credit(s))
- seedream-4: ByteDance Seedream 4.5 — new-generation image creation with superior aesthetics, text rendering, and up to 4K resolution. (1 credit(s))
- seedream-5-lite: ByteDance Seedream 5.0 Lite — fast, high-quality image generation and editing with strong aesthetics and text rendering. (1 credit(s))
- text-to-music: ElevenLabs Music via Replicate — generate music from a text prompt. (2 credit(s))
- text-to-speech: MiniMax Speech 2.8 Turbo via Replicate — convert text into natural-sounding speech. (0.1 credit(s))
- veo-3.1-fast: Faster generation at 3 credits per second (3 credits/sec)
- veo-3.1-standard: Higher quality at 8 credits per second (8 credits/sec)
- veo-3.1-lite: Runware-powered Lite variant at 1.5 credits/sec for 720p and 2 credits/sec for 1080p. No reference images, no audio generation, no 1:1 aspect ratio. (1.5 credits/sec)
- video-autocaption: TikTok-style auto-captioning via Replicate. (5 credit(s))
- video-reframe: Luma Reframe Video via Replicate — change a video's aspect ratio intelligently. (8 credit(s))
- video-to-sound: ThinkSound via Replicate — generate a sound effect track from a video. (2 credit(s))
- video-transform: Runway Gen4 Aleph via Replicate — transform the first 5 seconds of a video with a prompt. (20 credit(s))
- video-upscaler: Topaz Labs Video Upscale via Replicate — upscale video resolution and FPS. (10 credit(s))
- wan-2.2-standard: Premium quality with enhanced detail (3 credit(s))
- wan-2.2-plus: Official Alibaba model with 1080p support (10 credit(s))
- wan-2.2-extended: fal.ai WAN 2.2 with up to 10-second videos and dual LoRA support (1.2 credits/sec)
- wan-2.2-animate: WAN 2.2 video animation — drive a character image with a motion reference video. (10 credit(s))
- wan-2.2-replace: WAN 2.2 character replacement — swap a character in a source video while preserving scene and motion. (10 credit(s))
- wan-2.6-standard: Higher quality, 720p/1080p support (2.5 credits/sec)
- wan-2.6-flash: Fast and affordable image-to-video (1 credits/sec)
- wan-2.6-image: Alibaba WAN 2.6 text-to-image with prompt enhancement and multi-image output. (1 credit(s))
- wan-2.6-image-edit: Alibaba WAN 2.6 image editing — up to 4 reference images. (1 credit(s))
- wan-2.7-image: Faster Wan 2.7 image generation and editing (1 credit(s))
- wan-2.7-image-pro: Higher quality Wan 2.7 tier with 4K support for text-to-image (2 credit(s))
- wan-2.7-image-edit: Alibaba WAN 2.7 image editing — Standard and Pro tiers, supports 1-4 input images for fusion edits. (1 credit(s))
- wan-2.7-t2v: Text-to-video with audio sync, 720p/1080p output, and 2-15 second durations (2.5 credits/sec)
- wan-2.7-i2v: Image-to-video and video continuation with optional last-frame control and audio sync (2.5 credits/sec)
- wan-image: Fast cinematic image generation (3-6 seconds) with up to 4MP output and optional LoRA support. (1 credit(s))
- wan-reference-to-video: Alibaba WAN reference-to-video — up to 5 image/video references with multi-shot support. (4 credit(s))
- wan-video-character-swap: Alibaba WAN character swap — combine a character image with a reference video to produce a new clip. (20 credit(s))
- wan-video-edit: Alibaba WAN 2.7 video editing — modify an existing clip via prompt with optional reference images. (6 credit(s))
- xai-image: xAI Grok Imagine — sync image generation with fast results and natural aesthetics. (1 credit(s))
- xai-image-edit: xAI Grok image editing — sync response (no polling). Provide an image URL and a text edit instruction. (1 credit(s))
- xai-video: xAI Grok Imagine video — text-to-video or image-to-video, 1-15 seconds at 480p or 720p. (10 credit(s))
- xai-video-edit: xAI Grok Imagine Video edit — transform short clips via Replicate. (15 credit(s))
- z-image-turbo: Super-fast 6B parameter text-to-image with great text rendering and LoRA support. (0.5 credit(s))
## Documentation
- [Full API Docs (LLM-optimized)](https://pixeldojo.ai/llm.txt): Complete API reference in plain text
- [OpenAPI Spec](https://pixeldojo.ai/api/openapi): Machine-readable OpenAPI 3.1 specification
- [API Docs (HTML)](https://pixeldojo.ai/api-docs): Static HTML API reference
- [API Platform](https://pixeldojo.ai/api-platform): Interactive dashboard for keys, usage, and docs
- [AI Plugin Manifest](https://pixeldojo.ai/.well-known/ai-plugin.json): Agent plugin manifest
## Quick Start
1. Get an API key: https://pixeldojo.ai/api-platform/api-keys
2. Submit a job: POST https://pixeldojo.ai/api/v1/models/flux-1.1-pro/run
3. Poll for results: GET https://pixeldojo.ai/api/v1/jobs/{jobId}
## Links
- Website: https://pixeldojo.ai
- API Platform: https://pixeldojo.ai/api-platform
- Documentation: https://pixeldojo.ai/api-platform/documentation
- ComfyUI Plugin: https://github.com/blovett80/ComfyUI-PixelDojo
Document
Not stored for this site.