audia

Turn documents and ideas into audio, intelligently.

audia is an agentic Python package that converts PDFs — academic papers, reports, regulations — into podcast-style audio files. An LLM rewrites the content into natural spoken language (math in plain English, tables as sentences, citations removed) before passing it to a TTS engine.


At a glance

Feature

Detail

LLM curation

Mandatory pass rewrites math, condenses tables, removes citations

TTS backends

edge-tts (free default), kokoro (local), OpenAI TTS

LLM backends

OpenAI, Anthropic

ArXiv research

Search → select → convert in one command

Voice input

Record a spoken query → STT → LLM distillation → ArXiv search

Web UI

FastAPI backend + React/Tailwind SPA

CLI

convert, research, listen, serve, info

Storage

SQLite via SQLAlchemy; papers + audio files

Python

3.10 – 3.13

Quick install

pip install audia