A bot that converts e-books to various formats, powered by calibre, Pandoc, and Antiword! It currently supports 57 input formats and 31 output formats.
You can start using it or adding it to your group here on Telegram.
This project is a modular bot, made using Python 3 and the following:
This bot aims to provide an easy way for telegram users to convert e-books from and to different formats.
It supports converting from the following formats:
['azw', 'azw3', 'azw4', 'azw8', 'adoc', 'asciidoc', 'bok', 'cb7', 'cbc', 'cbr',
'cbz', 'chm', 'csv', 'djv', 'djvu', 'doc', 'docm', 'docx', 'epub', 'fb2', 'fbz',
'htm', 'html', 'htmlz', 'ipynb', 'kepub', 'kfx', 'kfx-zip', 'kpf', 'lit', 'lrf',
'md', 'mediawiki', 'mobi', 'odt', 'opf', 'org', 'pdb', 'pdf', 'pml', 'pptx',
'prc', 'rb', 'rst', 'rtf', 'snb', 't2t', 'tcr', 'tex', 'textile', 'tsv', 'txt',
'txtz', 'typ', 'typst', 'xhtml', 'xlsx']To the following formats:
['azw3', 'adoc', 'docx', 'epub', 'fb2', 'html', 'htmlz', 'kepub', 'kfx', 'lit',
'lrf', 'md', 'mobi', 'odt', 'oeb', 'org', 'pdb', 'pdf', 'pmlz', 'pptx', 'rb',
'rst', 'rtf', 'snb', 'tcr', 'tex', 'txt', 'txtz', 'typ', 'typst', 'zip']Some more features of the bot:
- Force book direction to be RTL
- Multilingual support, you can contribute and add your own languages if you want :).
- Flatten book's table of contents.
- Convert Shamela old
.bokfiles by first generating an EPUB in Python then using the existing calibre pipeline for other outputs. - Convert legacy Word
.docfiles by extracting text with Antiword, then using the existing Calibre or Pandoc pipeline. - Interactive conversion options before selecting output format.
- Per-user conversion option defaults are remembered automatically.
- Optional Pandoc backend for supported document routes, with Calibre as the default when both can convert the same route.
- Markdown-family, HTML, reStructuredText, AsciiDoc, Org, LaTeX, Typst, and plain-text document routes through Pandoc.
- Global output options: image compression, smart punctuation, text justification, line height, and paragraph spacing cleanup.
- DOCX options: page size and generated TOC toggle for Calibre; RTL output, H1/H2 heading page breaks, heading level shifts, and Arabic reference styling for Pandoc.
- EPUB output options: version selection, inline TOC, and background removal.
- PDF options: paper size, page numbers, cover-page generation, chapter page breaks, and Arabic font selection.
- Extra PDF fonts can be mounted locally without committing privately licensed font files.
- KFX options: PDOC/EBOK type and pages mode.
- EPUB input preprocessing options: fix EPUB metadata/spine issues, flatten TOC, and standardize footnotes.
- EPUB-to-EPUB volume splitting with per-volume output processing (up to 35 split files).
- Admin stats track users, recent attempts, success/failure rates, active users, and top conversion pairs.
- Admin broadcasts retry flood waits per recipient, remove permanently unreachable chats, and support
active_withinandusername_onlyfilters.
- Forward any supported file to the bot and choose the required format to convert to, and in few seconds the bot will reply you with the converted file.
- The bot works in groups too. Reply with
/convertto any file then do the same steps as in private. - You can change the preferences of the bot such as language using
/settingsor/preferencescommands.
Admin-only commands:
/statsshows user and conversion activity statistics./broadcastsends the replied message to all stored chats. You can add optional filters under the command:/broadcast active_within 30 username_only yes/updateupdates the bot from a GitHub source archive without requiring git in the runtime container./restartrestarts the bot.
- Copy
config.json.examplefile toconfig.jsonand fill the required information:{ "tg_bot_token": "11111111:xxxxxxxxxxxxxxxxxx", "tg_bot_id": 111111111, "api_key": 1121221, "api_hash": "xxxxxxxxxxxxxxxxxxxx", "tg_bot_admins": [ 2222222 ] } - Compile the translation files using the following command:
mise run i18n_compile
Before all, clone this repository.
- Simply, run the following command:
docker compose up --build -dThe Docker setup uses uv for Python dependencies, installs runtime dependencies into /opt/venv, and runs the bot from
the repository mounted at /app.
For KFX conversion on recent Docker versions, the compose file uses Docker's targeted seccomp workaround for Wine:
security_opt:
- seccomp=/etc/docker/seccomp-profile-v0.2.1.jsonDownload that profile on the host before starting the container, or remove this setting if you do not need KFX/Wine conversion.
Extra PDF fonts can be added without modifying the image. Put each font in its own directory under extra-pdf-fonts
and restart the bot. Local runs read ./extra-pdf-fonts by default:
extra-pdf-fonts/
my_font/
regular.ttf
bold.ttf
The directory name becomes the option value, the display label is inferred from it, and the internal font family is read
from regular.ttf. Add an optional profile.json only when you need overrides:
{
"label": "My Licensed Font",
"regular": "MyFont-Regular.ttf",
"bold": "MyFont-Bold.ttf",
"fallback": "serif"
}The compose file mounts this directory at /data/pdf-fonts and sets PDF_EXTRA_FONTS_DIR=/data/pdf-fonts. Set
PDF_EXTRA_FONTS_DIR yourself only when the fonts live somewhere else.
It requires Python 3.14 and uv.
Clone the repository and run:
uv sync --frozenThe bot uses SQLite through SQLAlchemy. Alembic runs database migrations automatically on startup.
You can go through the Dockerfile to see how the bot requirements are being installed.
- The conversion process is done by utilizing Calibre and its ebook-convert, so make sure you have it installed.
- Pandoc is required for Pandoc-backed document routes such as Markdown, HTML, reStructuredText, AsciiDoc, Org, LaTeX, Typst, and some DOCX/EPUB/TXT conversions.
- Antiword is required for legacy Word
.docinput files. The bot extracts.doctext first, then converts the extracted text to the requested output format. - To convert from and to KFX, you need to install KFX Input
and KFX Output plugins in calibre, this can be done from
the command line by using the following commands:
# KFX Input wget https://plugins.calibre-ebook.com/291290.zip calibre-customize -a 291290.zip # KFX Output wget https://plugins.calibre-ebook.com/272407.zip calibre-customize -a 272407.zip
- Also, KFX Output plugin requires Kindle Previewer 3, which can run on linux under Wine.
If you finally managed to get all pieces in its place without using docker, run the bot using:
uv run -m ebook_converter_botThe bot uses gettext for internationalization and mise tasks for running i18n commands easily.
- First, generate .pot template using
mise run i18n_generate_messages. - Update the current translation files using
mise run i18n_merge, then edit the translation strings. - Compile the translation files using
mise run i18n_compile.
To add a new language to the bot, run the following command (change 'ar' to your language code) then edit the new language file with translation and compile.
mise run i18n_init_lang -- arShamela old .bok files are first converted to an intermediate EPUB in pure Python (using access-parser), then (if the requested output isn't EPUB) calibre is used to convert that EPUB to the requested format.
For local debugging, there's also a small CLI wrapper:
uv run scripts/bok_to_epub.py path/to/book.bok