Essential Vision Tools & APIs
for Businesses & Developers

Access over 150+ API endpoints → at your fingertips. PixLab brings together ID document scanning, agentic desktop automation, document intelligence for businesses, background removal, watermark cleanup, inline image translation for developers & marketing teams, and creative tools including AI video editing for creators in one API stack.

Open PixLab Console Start with PixLab

Quick Access

API Portal API Endpoints ID Scan & Extract Vision Platform Mobile Apps

Trusted by Product Teams and Developers Worldwide

1.5B+
Monthly Requests Served: Monthly API traffic ↗ processed globally; 99.9%
Uptime Reliability: Built on enterprise-grade infrastructure; 150+
Active API Endpoints: API Endpoints expanding continuously

Unified API Platform

One Platform for
Vision, Document, and Identity APIs

Build with a unified API stack for agentic desktop automation, document intelligence, content moderation, ID workflows, and media transformation. Test quickly, manage keys centrally, and deploy production features without stitching together disconnected services.

Open API Portal Browse Endpoints

FACEIO Facial Authentication

Enable passwordless sign-in and identity verification on web and mobile with FACEIO, powered by secure, privacy-first facial recognition.

Start with FACEIO

DOCSCAN ID & Document Extraction

Parse passports, national IDs, and driver's licenses from 197+ countries into normalized JSON. Supports 11,000+ document types with lightweight integration.

Explore ID Scan API

Vision Workspace + VLM APIs

Extract structured data, answer document questions, and automate image understanding with vision-language workflows powered by leading Vision Language Models.

Explore Vision Platform

Creative and Media Tooling

Build faster with AI Photo Editor, AI Video Editor, Annotate, Creative Toolbox, Convert Box, and Bulk Image Background Removal in production-ready browser workflows.

Explore Creative Tools

ID Verification & KYC

DOCSCAN API for
Global Identity and Document Verification

Extract structured JSON from 11,000+ identity document types, including passports, driver's licenses, national IDs, and supporting documents across 197+ countries and territories. DOCSCAN is built for reliable onboarding, KYC, and back-office verification workflows.

Document Identification Instantly recognize and classify passports, national IDs, and driver's licenses before data extraction begins.
Precise Data Extraction Extract full name, ID number, date of birth, address, MRZ code, and more from any supported document in real time.
Privacy-First Processing All processing runs strictly in-memory. Zero persistent storage, zero retained images - enforced by default.

Read DOCSCAN Docs Create API Key

docscan.py

Python

# ID Scan and Extract - Python code sample
# Endpoint: https://pixlab.io/endpoints/docscan

import requests

req = requests.get('https://api.pixlab.io/docscan', params={
  'img':  'https://example.com/passport.jpg',
  'type': 'passport',  # 'idcard' | 'passport' | 'driverslicense'
  'key':  'PIXLAB_API_KEY', # Get yours at console.pixlab.io
})

reply = req.json()

if reply['status'] != 200:
    print(reply['error'])
else:
    # Parsed fields returned from the document
    fields = reply['fields']
    print("Issuing Country: "  + fields.get('issuingCountry',  'N/A'))
    print("Full Name:        "  + fields.get('fullName',       'N/A'))
    print("Document Number:  "  + fields.get('documentNumber',  'N/A'))
    print("Date of Birth:    "  + fields.get('dateOfBirth',    'N/A'))
    print("Address:          "  + fields.get('address',        'N/A'))
    print("MRZ Code:         "  + fields.get('mrz',            'N/A'))

REST · No SDK required · JSON response Full reference →

MRZ + Non-MRZ Coverage

Process passports, ID cards, and licenses with or without MRZ using one consistent DOCSCAN workflow.

Normalized JSON Output

Return standardized fields like full name, document number, dates, and address for direct backend and KYC pipeline use.

Face Crop + Quality Signals

Use extracted face crops and image quality indicators to improve verification confidence and reduce manual review time.

Image Processing & Media APIs

Text, Watermark & Background Removal
and Image Translation APIs

Ship production image workflows with focused REST endpoints. Clean overlays and watermarks, remove backgrounds, and localize embedded image text while preserving visual quality.

POST /txtremove

Text & Watermark Remove API

Remove text overlays, captions, and watermark artifacts using inpainting-based reconstruction. Output remains natural for downstream editing, publishing, or model ingestion workflows.

Automated Inpainting Reconstructs removed regions so cleaned assets can be reused in professional creative and content operations.
Simple POST Integration Upload local files as multipart form data and get clean output in the same JSON response pattern used by other PixLab endpoints.
Composable with Other APIs Chain cleaned output into OCR, translation, and document extraction flows for end-to-end automation.

View TXT-REMOVE Docs Create API Key

txtremove.py

Python

# Remove text/watermark from a local image
import base64, requests

with open('./watermarked.jpg', 'rb') as image_file:
    req = requests.post(
        'https://api.pixlab.io/txtremove',
        files={'file': image_file},
        data={'key': 'PIXLAB_API_KEY'}
    )

reply = req.json()
if reply['status'] != 200:
    print(reply.get('error', 'Request failed'))
else:
    with open('cleaned.png', 'wb') as f:
        f.write(base64.b64decode(reply['imgData']))
    print('Saved cleaned.png')

Image with text watermark before cleanup — WATERMARKED

Image after text and watermark removal — CLEANED

bgremove.py

Python

# Background removal with a public image URL
import base64, requests

reply = requests.get(
    'https://api.pixlab.io/bgremove',
    params={
        'img': 'https://example.com/product.jpg',
        'key': 'PIXLAB_API_KEY'
    }
).json()

if reply['status'] != 200:
    print(reply.get('error', 'Request failed'))
else:
    img_data = reply['imgData']
    with open('bg-removed.png', 'wb') as f:
        f.write(base64.b64decode(img_data))
    print('Saved bg-removed.png')

Input image for background removal API — INPUT

Background removed output example — OUTPUT

GET /bgremove

Background Removal API

Isolate the main subject and remove the background with pixel-accurate segmentation. Ideal for product catalogs, ad creatives, profile photos, and automated media pipelines.

High-Precision Subject Segmentation Automatically detects foreground objects without manual masking, making output reliable across varied scenes.
Flexible Output Delivery Receive image data as base64 by default, with optional direct file link output when object storage integration is enabled.
Scalable Batch Workflows Pair with Bulk Image Background Removal for high-volume processing.

View BG-REMOVE Docs Open Bulk Tool

POST /imgtranslate

Image Text Translation API

Detect, translate, and redraw text directly inside the same image while preserving the original composition. Useful for localizing product banners, app screenshots, signage, and marketing creatives.

Language-Aware Translation Use src_lang=auto for source detection and set the output language with dst_lang.
Layout Preservation Text is replaced inline to keep the image readable and visually consistent with the original design.
Developer-Friendly Output Returns a standard JSON response with status and translated image payload for direct integration into existing pipelines.

View IMG-TRANSLATE Docs Get API Key

imgtranslate.py

Python

# Inline image text translation (multipart POST)
import base64, requests

with open('./local_image.png', 'rb') as image_file:
    req = requests.post(
        'https://api.pixlab.io/imgtranslate',
        files={'file': image_file},
        data={'src_lang': 'auto', 'dst_lang': 'fr', 'key': 'PIXLAB_API_KEY'}
    )

reply = req.json()
if reply['status'] != 200:
    print(reply.get('error', 'Request failed'))
else:
    with open('translated.png', 'wb') as f:
        f.write(base64.b64decode(reply['imgData']))
    print('Saved translated.png')

Source image with embedded text before translation — SOURCE (ENGLISH)

Translated image output from image text translation API — TRANSLATED (FRENCH)

Developer Workflow

Developer Experience
from Sandbox to Production

Start with tested examples, validate payloads in the workspace, and move into production with endpoint-level control and predictable outputs.

Generate Your First API Key

AI Video Editor interface with timeline, clips, and browser-based editing controls

Use Cases & Workflows

Scalable & Unified APIs at Your Fingertips!

These examples reflect common production patterns across fintech, commerce, logistics, operations, and creator tooling. Use production-ready APIs, browser tools, and mobile apps to move from proof of concept to release with less integration overhead.

ID verification and KYC workflow using the DOCSCAN API

ID Scan API

ID Verification & KYC

Extract structured data from passports, visas, and national IDs to verify user identity and automate onboarding workflows.

Open DOCSCAN Reference

Vision LLM API

Document Q&A

Ask questions over contracts, invoices, forms, and reports - and return structured answers using vision-language models.

Explore Vision LLMs

Content Moderation API

Content Moderation

Score image uploads for adult or graphic content using the NSFW endpoint, then automatically blur or reject unsafe assets.

Try NSFW Endpoint

Face detection and privacy blur applied to human faces in an image

Face Detection API

Detect & Blur Faces

Detect all human faces in an image or video frame, then apply a privacy blur - using just two PixLab API calls.

Try FACEDETECT

Background Remove API

Background Removal

Remove backgrounds from product shots, portraits, and creative assets with a single API call - no manual masking needed.

Try BG-REMOVE

Convert Box web interface for media and document format conversion in the browser

Media Conversion

Media & Document Conversion

Convert between 239+ image, video, audio, and document formats in-browser or via API - no installs, no server setup, batch-ready.

Open Convert Box

Explore All API Use Cases

Authentication, Identity & Documents

Passwordless Auth, Global KYC,
and Programmatic Document Generation

Combine FACEIO for passwordless sign-in, DOCSCAN for global identity document extraction, and PDF APIs for reports, certificates, and automated outputs.

Authentication Framework

FACEIO: Passwordless Facial Authentication

Add face-based login to any web application in minutes. No passwords or OTPs, just a camera and a few lines of JavaScript, with liveness detection and anti-spoofing.

Integrate FACEIO

FACEIO passwordless facial authentication flow for secure sign-in

Intelligent Workflows

Combine PixLab APIs to build complete document workflows, from identity verification to automated reporting, without stitching together multiple vendors.

eKYC Onboarding Flow Scan ID documents with DOCSCAN then confirm liveness with FACEIO ↗ for full KYC in a single flow.
Invoice & Contract Auditing Ask questions over uploaded PDFs, contracts, and invoices using the Vision LLM API to extract key clauses and figures instantly.
Automated Report Generation Pipe OCR-extracted data directly into the PDF Generation API to produce branded compliance reports and summaries programmatically.

Vision LLM API View All Endpoints

Rich PDF Generation for Operational Documents

Generate pixel-perfect PDFs from HTML templates, Markdown, or structured JSON data. Build invoices, compliance reports, certificates, and branded documents - entirely via API with no design tools needed.

HTML/CSS + Markdown → PDF in a single REST call
Custom headers, footers, watermarks, and multi-page layouts
PDF ↔ image conversion via pdftoimg endpoint

Explore PDF APIs View pdfgen Reference

Vision Platform

Build, Test, and Ship Faster with
PixLab Vision Platform

Prototype in the workspace, validate with real payloads, and deploy through endpoint APIs with consistent behavior across environments.

Upcoming: Think-Act Desktop Agent

Think-Act is an upcoming desktop agent powered by PixLab vision APIs. It understands on-screen context, reasons through multi-step tasks, and executes actions with human-in-the-loop control.

Understands on-screen context across apps, tabs, and workflows.
Plans multi-step tasks with reasoning before taking action.
Executes clicks, typing, and navigation with human-in-the-loop control.

Think-Act desktop agent concept - AI reasoning over screen context with human approval

PixLab Vision Platform workspace - testing OCR and VLM workflows with real document payloads

Vision Workspace and API Docs in One Loop

Test OCR and VLM workflows with real payloads, then move directly into implementation with endpoint docs and examples.

Upload real files and test OCR, VLM parsing, and document Q&A in one workspace.
Validate extracted fields and structured outputs before writing production integration code.
Move directly from workspace testing to endpoint docs, request examples, and implementation references.

Open Vision Workspace

Enterprise

On-Premises and Enterprise Deployment

Run the full PixLab API stack behind your own firewall. Built for regulated industries requiring air-gapped environments, custom SLAs, and dedicated infrastructure.

Air-gapped & private cloud deployment
Covers ID Scan, OCR, Vision LLM & Media APIs
GDPR, HIPAA & financial compliance ready

Explore On-Premises

Additional Browser-Based Tools

Browser-native tools for UI code generation, design mockups, image annotation, and media conversion.

APP UX SwiftUI & Flutter code gen

Convert Box 239+ media format converter

Annotate Image labeling for ML/CV datasets

Mockup Editor Polish screenshots & mockups

Mind Map Online mind mapping tool

Tilemap Editor 2D game tile map design tool

Mobile Apps Hub

PixLab Mobile Apps Hub
Built on the Same APIs You Use

Explore PixLab iOS apps for video editing, photo editing, OCR scanning, and live translation in one Mobile Hub. Each app is powered by the same APIs available to developers.

AI Video Editor

Trim, merge, and apply AI effects to videos directly on iPhone. A full-featured mobile editor for creating polished content without desktop tools.

View iOS Product Page Open Web Version

AI Video Editor - iOS mobile app interface preview

AI Photo Editor

Full-featured photo editor for iOS - AI background removal, filters, layers, and creative effects on the go.

View iOS Product Page

AI Photo Editor - iOS app interface showing creative editing tools

DOCSCAN

Scan, digitize, and extract data from ID documents, receipts, and forms - powered by the DOCSCAN API.

View iOS Product Page

World Lens

Point and translate - live on-screen translation of text in the real world, powered by PixLab OCR and translation APIs.

View iOS Product Page

Four apps, one hub, one API foundation.

These apps are built on the exact same REST APIs available to every developer. Explore our mobile hub, or start using our APIs to build your own.

Open Mobile Apps Hub Build with PixLab APIs

Open Source SDKs and Native Libraries

Build Locally with
Native C and C++ Libraries

Run vision, inference, and data workflows in embedded, desktop, or edge environments when local execution and runtime control matter.

SOD

Embedded computer vision and machine learning library in C for real-time object detection and face recognition in CPU-first environments.

Learn more

SyNumPy

C++ NumPy file reader/writer - read and write .npy and .npz array files from C++ without Python, for ML pipelines.

Learn more

Tiny Dream

Stable Diffusion inference in pure C/C++ - generate images from text prompts on CPU without Python or PyTorch.

Learn more

ART

Real-Time ASCII Art C Library - transform images and video frames into ASCII art at runtime with configurable density.

Learn more

Explore PixLab's open source libraries and start building locally.

Open Downloads

Platform Overview

Build Vision Apps Faster
with APIs, SDKs, and Tools

From OCR and identity verification to media transformation, PDF generation, and mobile apps, PixLab provides the building blocks for end-to-end vision products.

Vision Platform and OCR

Chat with documents, extract structured data from any layout, and build RAG pipelines using the Vision Platform and VLM API.

FACEIO and Identity Stack

Passwordless facial login with FACEIO and full KYC onboarding via DOCSCAN - 11,000+ document types, 197+ countries.

Rich PDF Generation

Convert HTML, Markdown, or JSON into production-ready PDFs. Power invoices, reports, and certificates from one REST endpoint - no design tools needed.

App UI/UX Code Generator

Generate production-ready mobile UI code for SwiftUI, Flutter, React Native, and Jetpack Compose from screenshots, sketches, or natural language descriptions.

Media Processing & Creative Suite

150+ REST endpoints for background removal, watermark cleanup, smart crop, content moderation, OCR, and object detection - plus browser-native Video and Photo Editors, Creative Toolbox, and Bulk Image Background Removal.

On-Premises & Enterprise

Run PixLab APIs behind your own firewall with full on-premises deployment. Air-gapped environments, custom SLAs, and dedicated infrastructure for regulated industries.

Ready to Build?

Launch Vision Features
Without Rebuilding Your Stack

Use PixLab to build, test, and deploy document AI, identity, OCR, and media workflows with fewer moving parts and faster delivery cycles.

Start Building with PixLab View Pricing Plans

Essential Vision Tools & APIs for Businesses & Developers

FACEIO Facial Authentication

DOCSCAN ID & Document Extraction

Vision Workspace + VLM APIs

Creative and Media Tooling

DOCSCAN API for Global Identity and Document Verification

MRZ + Non-MRZ Coverage

Normalized JSON Output

Face Crop + Quality Signals

Text & Watermark Remove API

Background Removal API

Image Text Translation API

AI Editing Tools for Photo, Video, and Conversion

Use AI Vision to Extract Data from Documents

150+ Vision and Media APIs

Scan & Extract Data from IDs in Real Time

Background Removal in a Multi-Step Pipeline

Mobile Hub for OCR, Scanning & Editing

ID Verification & KYC

Document Q&A

Content Moderation

Detect & Blur Faces

Background Removal

Media & Document Conversion

FACEIO: Passwordless Facial Authentication

Intelligent Workflows

Rich PDF Generation for Operational Documents

Upcoming: Think-Act Desktop Agent

Vision Workspace and API Docs in One Loop

On-Premises and Enterprise Deployment

Additional Browser-Based Tools

AI Video Editor

AI Photo Editor

DOCSCAN

World Lens

Four apps, one hub, one API foundation.

SOD

SyNumPy

Tiny Dream

ART

Vision Platform and OCR

FACEIO and Identity Stack

Rich PDF Generation

App UI/UX Code Generator

Media Processing & Creative Suite

On-Premises & Enterprise

Launch Vision Features Without Rebuilding Your Stack

Essential Vision Tools & APIs
for Businesses & Developers

DOCSCAN API for
Global Identity and Document Verification

Launch Vision Features
Without Rebuilding Your Stack