Explore 150+ Vision API Endpoints

Build Vision Apps Faster with APIs, SDKs, and Tools

PixLab helps teams ship document AI, identity verification, media processing, and creative workflows from one platform. Use ready-to-deploy APIs, production tools, and mobile apps to move from proof of concept to release with less integration overhead.

Trusted by Product Teams and Developers Worldwide

Creative Zone - PixLab client
Media Hub - PixLab Client
Hub People - PixLab client
Symisc - PixLab Client
GVIC - PixLab client
1.5B+
Monthly Requests Served
Monthly API traffic ↗ processed globally
99.9%
Uptime Reliability
Built on enterprise-grade infrastructure
150+
Active API Endpoints
API Endpoints expanding continuously
Unified API Platform

One Platform for Vision, Document, and Identity APIs

Build with a unified API stack for OCR, document intelligence, content moderation, ID workflows, and media transformation. Test quickly, manage keys centrally, and deploy production features without stitching together disconnected services.

Facial Authentication icon

FACEIO Facial Authentication

Enable passwordless sign-in and identity verification on web and mobile with FACEIO, powered by secure, privacy-first facial recognition.

Start with FACEIO
ID Scan & Extract API icon

DOCSCAN ID & Document Extraction

Parse passports, national IDs, and driver's licenses from 197+ countries into normalized JSON. Supports 11,000+ document types with lightweight integration.

Explore ID Scan API
Vision Workspace + VLM APIs icon

Vision Workspace + VLM APIs

Extract structured data, answer document questions, and automate image understanding with vision-language workflows powered by leading VLMs.

Explore Vision Platform
Creative Tools icon

Creative and Media Tooling

Build faster with AI Photo Editor, AI Video Editor, Annotate, Creative Toolbox, Convert Box, and Bulk Image Background Removal in production-ready browser workflows.

Explore Creative Tools
ID Verification & KYC

DOCSCAN API for Global Identity and Document Verification

Extract structured JSON from 11,000+ identity document types, including passports, driver's licenses, national IDs, and supporting documents across 197+ countries and territories. DOCSCAN is built for reliable onboarding, KYC, and back-office verification workflows.

  • Document Identification Instantly recognize and classify passports, national IDs, and driver's licenses before data extraction begins.
  • Precise Data Extraction Extract full name, ID number, date of birth, address, MRZ code, and more from any supported document in real time.
  • Privacy-First Processing All processing runs strictly in-memory. Zero persistent storage, zero retained images - enforced by default.
docscan.py
# ID Scan and Extract - Python code sample
# Endpoint: https://pixlab.io/endpoints/docscan

import requests

req = requests.get('https://api.pixlab.io/docscan', params={
  'img':  'https://example.com/passport.jpg',
  'type': 'passport',  # 'idcard' | 'passport' | 'driverslicense'
  'key':  'PIXLAB_API_KEY', # Get yours at console.pixlab.io
})

reply = req.json()

if reply['status'] != 200:
    print(reply['error'])
else:
    # Parsed fields returned from the document
    fields = reply['fields']
    print("Issuing Country: "  + fields.get('issuingCountry',  'N/A'))
    print("Full Name:        "  + fields.get('fullName',       'N/A'))
    print("Document Number:  "  + fields.get('documentNumber',  'N/A'))
    print("Date of Birth:    "  + fields.get('dateOfBirth',    'N/A'))
    print("Address:          "  + fields.get('address',        'N/A'))
    print("MRZ Code:         "  + fields.get('mrz',            'N/A'))
REST · No SDK required · JSON response Full reference →

MRZ + Non-MRZ Coverage

Process passports, ID cards, and licenses with or without MRZ using one consistent DOCSCAN workflow.

Normalized JSON Output

Return standardized fields like full name, document number, dates, and address for direct backend and KYC pipeline use.

Face Crop + Quality Signals

Use extracted face crops and image quality indicators to improve verification confidence and reduce manual review time.

Image Processing & Media APIs

Essential Vision Tools & APIs for Businesses & Developers

Ship production image workflows with focused REST endpoints. Clean overlays and watermarks, remove backgrounds, and localize embedded image text while preserving visual quality.

POST /txtremove

Text & Watermark Remove API

Remove text overlays, captions, and watermark artifacts using inpainting-based reconstruction. Output remains natural for downstream editing, publishing, or model ingestion workflows.

  • Automated Inpainting Reconstructs removed regions so cleaned assets can be reused in professional creative and content operations.
  • Simple POST Integration Upload local files as multipart form data and get clean output in the same JSON response pattern used by other PixLab endpoints.
  • Composable with Other APIs Chain cleaned output into OCR, translation, and document extraction flows for end-to-end automation.
txtremove.py
# Remove text/watermark from a local image
import base64, requests

with open('./watermarked.jpg', 'rb') as image_file:
    req = requests.post(
        'https://api.pixlab.io/txtremove',
        files={'file': image_file},
        data={'key': 'PIXLAB_API_KEY'}
    )

reply = req.json()
if reply['status'] != 200:
    print(reply.get('error', 'Request failed'))
else:
    with open('cleaned.png', 'wb') as f:
        f.write(base64.b64decode(reply['imgData']))
    print('Saved cleaned.png')
WATERMARKED
Image with text watermark before cleanup
CLEANED
Image after text and watermark removal
bgremove.py
# Background removal with a public image URL
import base64, requests

reply = requests.get(
    'https://api.pixlab.io/bgremove',
    params={
        'img': 'https://example.com/product.jpg',
        'key': 'PIXLAB_API_KEY'
    }
).json()

if reply['status'] != 200:
    print(reply.get('error', 'Request failed'))
else:
    img_data = reply['imgData']
    with open('bg-removed.png', 'wb') as f:
        f.write(base64.b64decode(img_data))
    print('Saved bg-removed.png')
INPUT
Input image for background removal API
OUTPUT
Background removed output example
GET /bgremove

Background Removal API

Isolate the main subject and remove the background with pixel-accurate segmentation. Ideal for product catalogs, ad creatives, profile photos, and automated media pipelines.

  • High-Precision Subject Segmentation Automatically detects foreground objects without manual masking, making output reliable across varied scenes.
  • Flexible Output Delivery Receive image data as base64 by default, with optional direct file link output when object storage integration is enabled.
  • Scalable Batch Workflows Pair with Bulk Image Background Removal for high-volume processing.
POST /imgtranslate

Image Text Translation API

Detect, translate, and redraw text directly inside the same image while preserving the original composition. Useful for localizing product banners, app screenshots, signage, and marketing creatives.

  • Language-Aware Translation Use src_lang=auto for source detection and set the output language with dst_lang.
  • Layout Preservation Text is replaced inline to keep the image readable and visually consistent with the original design.
  • Developer-Friendly Output Returns a standard JSON response with status and translated image payload for direct integration into existing pipelines.
imgtranslate.py
# Inline image text translation (multipart POST)
import base64, requests

with open('./local_image.png', 'rb') as image_file:
    req = requests.post(
        'https://api.pixlab.io/imgtranslate',
        files={'file': image_file},
        data={'src_lang': 'auto', 'dst_lang': 'fr', 'key': 'PIXLAB_API_KEY'}
    )

reply = req.json()
if reply['status'] != 200:
    print(reply.get('error', 'Request failed'))
else:
    with open('translated.png', 'wb') as f:
        f.write(base64.b64decode(reply['imgData']))
    print('Saved translated.png')
SOURCE (ENGLISH)
Source image with embedded text before translation
TRANSLATED (FRENCH)
Translated image output from image text translation API
Use Cases & Workflows

Bring Vision to Your Apps

These examples reflect common production patterns across fintech, commerce, logistics, operations, and creator tooling.

ID verification and KYC workflow using the DOCSCAN API
ID Scan API

ID Verification & KYC

Extract structured data from passports, visas, and national IDs to verify user identity and automate onboarding workflows.

Open DOCSCAN Reference
Vision Platform document Q&A - asking questions over contracts and invoices
Vision LLM API

Document Q&A

Ask questions over contracts, invoices, forms, and reports - and return structured answers using vision-language models.

Explore Vision LLMs
Content moderation - NSFW detection and automatic blur applied to unsafe images
Content Moderation API

Content Moderation

Score image uploads for adult or graphic content using the NSFW endpoint, then automatically blur or reject unsafe assets.

Try NSFW Endpoint
Face detection and privacy blur applied to human faces in an image
Face Detection API

Detect & Blur Faces

Detect all human faces in an image or video frame, then apply a privacy blur - using just two PixLab API calls.

Try FACEDETECT
Background removal - subject isolated from image background using deep learning
Background Remove API

Background Removal

Remove backgrounds from product shots, portraits, and creative assets with a single API call - no manual masking needed.

Try BG-REMOVE
Convert Box web interface for media and document format conversion in the browser
Media Conversion

Media & Document Conversion

Convert between 239+ image, video, audio, and document formats in-browser or via API - no installs, no server setup, batch-ready.

Open Convert Box
Developer Workflow

Developer Experience from Sandbox to Production

Start with tested examples, validate payloads in the workspace, and move into production with endpoint-level control and predictable outputs.

Generate Your First API Key
Authentication, Identity & Documents

Passwordless Auth, Global KYC, and Programmatic Document Generation

Combine FACEIO for passwordless sign-in, DOCSCAN for global identity document extraction, and PDF APIs for reports, certificates, and automated outputs.

Authentication Framework

FACEIO: Passwordless Facial Authentication

Add face-based login to any web application in minutes. No passwords or OTPs, just a camera and a few lines of JavaScript, with liveness detection and anti-spoofing.

Integrate FACEIO
FACEIO passwordless facial authentication flow for secure sign-in

Intelligent Workflows

Combine PixLab APIs to build complete document workflows, from identity verification to automated reporting, without stitching together multiple vendors.

  • eKYC Onboarding Flow Scan ID documents with DOCSCAN then confirm liveness with FACEIO ↗ for full KYC in a single flow.
  • Invoice & Contract Auditing Ask questions over uploaded PDFs, contracts, and invoices using the Vision LLM API to extract key clauses and figures instantly.
  • Automated Report Generation Pipe OCR-extracted data directly into the PDF Generation API to produce branded compliance reports and summaries programmatically.

Rich PDF Generation for Operational Documents

Generate pixel-perfect PDFs from HTML templates, Markdown, or structured JSON data. Build invoices, compliance reports, certificates, and branded documents - entirely via API with no design tools needed.

  • HTML/CSS + Markdown → PDF in a single REST call
  • Custom headers, footers, watermarks, and multi-page layouts
  • PDF ↔ image conversion via pdftoimg endpoint
Document Intelligence Workflow
Vision Platform

Build, Test, and Ship Faster with PixLab Vision Platform

Prototype in the workspace, validate with real payloads, and deploy through endpoint APIs with consistent behavior across environments.

Upcoming: Think-Act Desktop Agent

Think-Act is an upcoming desktop agent powered by PixLab vision APIs. It understands on-screen context, reasons through multi-step tasks, and executes actions with human-in-the-loop control.

  • Understands on-screen context across apps, tabs, and workflows.
  • Plans multi-step tasks with reasoning before taking action.
  • Executes clicks, typing, and navigation with human-in-the-loop control.
Think-Act desktop agent concept - AI reasoning over screen context with human approval
PixLab Vision Platform workspace - testing OCR and VLM workflows with real document payloads

Vision Workspace and API Docs in One Loop

Test OCR and VLM workflows with real payloads, then move directly into implementation with endpoint docs and examples.

  • Upload real files and test OCR, VLM parsing, and document Q&A in one workspace.
  • Validate extracted fields and structured outputs before writing production integration code.
  • Move directly from workspace testing to endpoint docs, request examples, and implementation references.
Open Vision Workspace
Enterprise

On-Premises and Enterprise Deployment

Run the full PixLab API stack behind your own firewall. Built for regulated industries requiring air-gapped environments, custom SLAs, and dedicated infrastructure.

  • Air-gapped & private cloud deployment
  • Covers ID Scan, OCR, Vision LLM & Media APIs
  • GDPR, HIPAA & financial compliance ready
Explore On-Premises
Mobile Apps Hub

PixLab Mobile Apps Hub Built on the Same APIs You Use

Explore PixLab iOS apps for video editing, photo editing, OCR scanning, and live translation in one Mobile Hub. Each app is powered by the same APIs available to developers.

AI Photo Editor

Full-featured photo editor for iOS - AI background removal, filters, layers, and creative effects on the go.

View iOS Product Page
AI Photo Editor - iOS app interface showing creative editing tools

World Lens

Point and translate - live on-screen translation of text in the real world, powered by PixLab OCR and translation APIs.

Four apps, one hub, one API foundation.

These apps are built on the exact same REST APIs available to every developer. Explore our mobile hub, or start using our APIs to build your own.

Open Source SDKs and Native Libraries

Build Locally with Native C and C++ Libraries

Run vision, inference, and data workflows in embedded, desktop, or edge environments when local execution and runtime control matter.

SOD

Embedded computer vision and machine learning library in C for real-time object detection and face recognition in CPU-first environments.

SyNumPy

C++ NumPy file reader/writer - read and write .npy and .npz array files from C++ without Python, for ML pipelines.

Tiny Dream

Stable Diffusion inference in pure C/C++ - generate images from text prompts on CPU without Python or PyTorch.

ART

Real-Time ASCII Art C Library - transform images and video frames into ASCII art at runtime with configurable density.

Explore PixLab's open source libraries and start building locally.

Open Downloads
Platform Overview

One Platform for Document, Identity & Media Workflows

From OCR and identity verification to media transformation, PDF generation, and mobile apps, PixLab provides the building blocks for end-to-end vision products.

Vision Platform and OCR

Chat with documents, extract structured data from any layout, and build RAG pipelines using the Vision Platform and VLM API.

FACEIO and Identity Stack

Passwordless facial login with FACEIO and full KYC onboarding via DOCSCAN - 11,000+ document types, 197+ countries.

Rich PDF Generation

Convert HTML, Markdown, or JSON into production-ready PDFs. Power invoices, reports, and certificates from one REST endpoint - no design tools needed.

On-Premises & Enterprise

Run PixLab APIs behind your own firewall with full on-premises deployment. Air-gapped environments, custom SLAs, and dedicated infrastructure for regulated industries.

Ready to Build?

Launch Vision Features Without Rebuilding Your Stack

Use PixLab to build, test, and deploy document AI, identity, OCR, and media workflows with fewer moving parts and faster delivery cycles.