GitHub - hydropix/TranslateBookWithLLM: A python script designed to translate large amounts of text with an LLM and the Ollama API

TBL is a Python application designed for large-scale text translation, such as entire books (.EPUB), subtitle file (.SRT) and plain text, leveraging local LLMs via the Ollama API or Gemini API. The tool offers both a web interface for ease of use and a command-line interface for advanced users.

Features

📚 Multiple Format Support: Translate plain text (.txt), book (.EPUB) and subtitle (.SRT) files while preserving formatting
🌐 Web Interface: User-friendly browser-based interface
💻 CLI Support: Command-line interface for automation and scripting
🤖 Multiple LLM Providers: Support for both local Ollama models and Google Gemini API
🐳 Docker Support: Easy deployment with Docker container

Windows Installation Guide

This comprehensive guide walks you through setting up the complete environment on Windows.

1. Prerequisites: Software Installation

Miniconda (Python Environment Manager)
- Purpose: Creates isolated Python environments to manage dependencies
- Download: Get the latest Windows 64-bit installer from the Miniconda install page
- Installation: Run installer, choose "Install for me only", use default settings
Ollama (Local LLM Runner)
- Purpose: Runs large language models locally
- Download: Get the Windows installer from Ollama website
- Installation: Run installer and follow instructions
Git (Version Control)
- Purpose: Download and update the script from GitHub
- Download: Get from https://git-scm.com/download/win
- Installation: Use default settings

2. Setting up the Python Environment

Open Anaconda Prompt (search in Start Menu)

Create and Activate Environment:

# Create environment
conda create -n translate_book_env python=3.9

# Activate environment (do this every time)
conda activate translate_book_env

3. Getting the Translation Application

# Navigate to your projects folder
cd C:\Projects
mkdir TranslateBookWithLLM
cd TranslateBookWithLLM

# Clone the repository
git clone https://github.com/hydropix/TranslateBookWithLLM.git .

4. Installing Dependencies

# Ensure environment is active
conda activate translate_book_env

# Install dependencies
pip install -r requirements.txt

5. Preparing Ollama

Download an LLM Model:

# Download the default model (recommended for French translation)
ollama pull mistral-small:24b

# Or try other models
ollama pull qwen2:7b
ollama pull llama3:8b

# List available models
ollama list

Start Ollama Service:
- Ollama usually runs automatically after installation
- Look for Ollama icon in system tray
- If not running, launch from Start Menu

6. Using the Application

Option A: Web Interface (Recommended)

Start the Server:

conda activate translate_book_env
cd C:\Projects\TranslateBookWithLLM
python translation_api.py

Open Browser: Navigate to http://localhost:5000
- Port can be configured via PORT environment variable
- Example: PORT=8080 python translation_api.py
Configure and Translate:
- Select source and target languages
- Choose your LLM model
- Upload your .txt or .epub file
- Adjust advanced settings if needed
- Start translation and monitor real-time progress
- Download the translated result

Option B: Command Line Interface

Basic usage:

python translate.py -i input.txt -o output.txt

Command Arguments

-i, --input: (Required) Path to the input file (.txt, .epub, or .srt).
-o, --output: Output file path. If not specified, a default name will be generated (format: input_translated.ext).
-sl, --source_lang: Source language (default: "English").
-tl, --target_lang: Target language (default: "French").
-m, --model: LLM model to use (default: "mistral-small:24b").
-cs, --chunksize: Target lines per chunk for text files (default: 25).
--api_endpoint: Ollama API endpoint (default: "http://localhost:11434/api/generate").
--provider: LLM provider to use ("ollama" or "gemini", default: "ollama").
--gemini_api_key: Google Gemini API key (required when using gemini provider).

Examples:

# Basic English to French translation (text file)
python translate.py -i book.txt -o book_fr.txt

# Translate EPUB file
python translate.py -i book.epub -o book_fr.epub

# Translate SRT subtitle file
python translate.py -i movie.srt -o movie_fr.srt

# English to German with different model
python translate.py -i story.txt -o story_de.txt -sl English -tl German -m qwen2:7b

# Custom chunk size for better context with a text file
python translate.py -i novel.txt -o novel_fr.txt -cs 40

# Using Google Gemini instead of Ollama
python translate.py -i book.txt -o book_fr.txt --provider gemini --gemini_api_key YOUR_API_KEY -m gemini-2.0-flash

EPUB File Support

The application fully supports EPUB files:

Preserves Structure: Maintains most of the original EPUB structure and formatting
Selective Translation: Only translates content blocks (paragraphs, headings, etc.)

SRT Subtitle File Support

The application fully supports SRT subtitle files:

Preserves Timing: Maintains all original timestamp information
Format Preservation: Keeps subtitle numbering and structure intact
Smart Translation: Translates only the subtitle text, preserving technical elements

Google Gemini Support

In addition to local Ollama models, the application now supports Google Gemini API:

Setup:

Get your API key from Google AI Studio
Use the --provider gemini flag with your API key

Available Gemini Models:

gemini-2.0-flash (default, fast and efficient)
gemini-1.5-pro (more capable, slower)
gemini-1.5-flash (balanced performance)

Web Interface:

Select "Google Gemini" from the LLM Provider dropdown
Enter your API key in the secure field
Choose your preferred Gemini model

CLI Example:

python translate.py -i book.txt -o book_translated.txt \
    --provider gemini \
    --gemini_api_key YOUR_API_KEY \
    -m gemini-2.0-flash \
    -sl English -tl Spanish

Note: Gemini API requires an internet connection and has usage quotas. Check Google's pricing for details.

Docker Support

Quick Start with Docker

# Build the Docker image
docker build -t translatebook .

# Run the container
docker run -p 5000:5000 -v $(pwd)/translated_files:/app/translated_files translatebook

# Or with custom port
docker run -p 8080:5000 -e PORT=5000 -v $(pwd)/translated_files:/app/translated_files translatebook

Docker Compose (Optional)

Create a docker-compose.yml file:

version: '3'
services:
  translatebook:
    build: .
    ports:
      - "5000:5000"
    volumes:
      - ./translated_files:/app/translated_files
    environment:
      - PORT=5000

Then run: docker-compose up

Advanced Configuration

Web Interface Settings

The web interface provides easy access to:

Chunk Size: Lines per translation chunk (10-100)
Timeout: Request timeout in seconds (30-600)
Context Window: Model context size (1024-32768)
Max Attempts: Retry attempts for failed chunks (1-5)
Custom Instructions (optional): Add specific translation guidelines or requirements
Enable Post-processing: Improve translation quality with additional refinement

Configuration Files

Configuration is centralized in src/config.py with support for environment variables:

Environment Variables (.env file)

Create a .env file in the project root to override default settings:

# Copy the example file
cp .env.example .env

# Edit with your settings
API_ENDPOINT=http://localhost:11434/api/generate
DEFAULT_MODEL=mistral-small:24b
MAIN_LINES_PER_CHUNK=25
# ... see .env.example for all available settings

prompts.py - Translation Prompts

The translation quality depends heavily on the prompt. The prompts are now managed in prompts.py:

# The prompt template uses the actual tags from config.py
structured_prompt = f"""
## [ROLE] 
# You are a {target_language} professional translator.

## [TRANSLATION INSTRUCTIONS] 
+ Translate in the author's style.
+ Precisely preserve the deeper meaning of the text.
+ Adapt expressions and culture to the {target_language} language.
+ Vary your vocabulary with synonyms, avoid repetition.
+ Maintain the original layout, remove typos and line-break hyphens.

## [FORMATTING INSTRUCTIONS] 
+ Translate ONLY the main content between the specified tags.
+ Surround your translation with {TRANSLATE_TAG_IN} and {TRANSLATE_TAG_OUT} tags.
+ Return only the translation, nothing else.
"""

Note: The translation tags are defined in config.py and automatically used by the prompt generator.

Custom Instructions Feature

You can enhance translation quality by providing custom instructions through the web interface or API:

Web Interface:

Add custom instructions in the "Custom Instructions" text field
Examples:
- "Maintain formal tone throughout the translation"
- "Keep technical terms in English"
- "Use Quebec French dialect"

The custom instructions are automatically integrated into the translation prompt.

Post-processing Feature

Enable post-processing to improve translation quality through an additional refinement pass:

How it works:

Initial translation is performed as usual
A second pass reviews and refines the translation
The post-processor checks for:
- Grammar and fluency
- Consistency in terminology
- Natural language flow
- Cultural appropriateness

Web Interface:

Toggle "Enable Post-processing" in advanced settings
Optionally add specific post-processing instructions

Post-processing Instructions Examples:

"Ensure consistent use of formal pronouns"
"Check for gender agreement in French"
"Verify technical terminology accuracy"
"Improve readability for children"

Note: Post-processing increases translation time but generally improves quality, especially for literary or professional texts.

Troubleshooting

Common Issues

Web Interface Won't Start:

# Check if the configured port is in use (default 5000)
netstat -an | find "5000"

# Try different port
# Default port is 5000, configured via PORT environment variable

Ollama Connection Issues:

Ensure Ollama is running (check system tray).
Verify no firewall blocking localhost:11434.
Test with: curl http://localhost:11434/api/tags.

Translation Timeouts:

Increase REQUEST_TIMEOUT in config.py (default: 60 seconds)
Use smaller chunk sizes
Try a faster model
For web interface, adjust timeout in advanced settings

Poor Translation Quality:

Experiment with different models.
Adjust chunk size for better context.
Modify the translation prompt.
Clean input text beforehand.

Model Not Found:

# List installed models
ollama list

# Install missing model
ollama pull your-model-name

Getting Help

Check the browser console for web interface issues
Monitor the terminal output for detailed error messages
Test with small text samples first
Verify all dependencies are installed correctly
For EPUB issues, check XML parsing errors in the console
Review config.py for adjustable timeout and retry settings

Architecture

The application follows a clean modular architecture:

Project Structure

src/
├── core/                    # Core translation logic
│   ├── text_processor.py    # Text chunking and context management
│   ├── translator.py        # Translation orchestration and job tracking
│   ├── llm_client.py        # Async API calls to LLM providers
│   ├── llm_providers.py     # Provider abstraction (Ollama, Gemini)
│   ├── epub_processor.py    # EPUB-specific processing
│   └── srt_processor.py     # SRT subtitle processing
├── api/                     # Flask web server
│   ├── routes.py           # REST API endpoints
│   ├── websocket.py        # WebSocket handlers for real-time updates
│   └── handlers.py         # Translation job management
├── web/                     # Web interface
│   ├── static/             # CSS, JavaScript, images
│   └── templates/          # HTML templates
└── utils/                   # Utilities
    ├── file_utils.py       # File processing utilities
    ├── security.py         # Security features for file handling
    ├── file_detector.py    # Centralized file type detection
    └── unified_logger.py   # Unified logging system

Root Level Files

translate.py: CLI interface (lightweight wrapper around core modules)
translation_api.py: Web server entry point
prompts.py: Translation prompt generation and management
.env.example: Example environment variables file

Configuration Files

src/config.py: Centralized configuration with environment variable support

Translation Pipeline

Text Processing: Intelligent chunking preserving sentence boundaries
Context Management: Maintains translation context between chunks
LLM Communication: Async requests with retry logic and timeout handling
Format-Specific Processing:
- EPUB: XML namespace-aware processing preserving structure
- SRT: Subtitle timing and format preservation
Error Recovery: Graceful degradation with original text preservation

The web interface communicates via REST API and WebSocket for real-time progress, while the CLI version provides direct access for automation.

Key Features Implementation

LLM Provider Architecture

Abstraction Layer: LLMProvider base class for easy provider addition
Multiple Providers: Built-in support for Ollama (local) and Gemini (cloud)
Factory Pattern: Dynamic provider instantiation based on configuration
Unified Interface: Consistent API across different LLM providers

Asynchronous Processing

Uses httpx for concurrent API requests
Implements retry logic with exponential backoff
Configurable timeout handling for long translations

Job Management System

Unique translation IDs for tracking multiple jobs
In-memory job storage with status updates
WebSocket events for real-time progress streaming
Support for translation interruption

Security Features

File type validation for uploads
Size limits for uploaded files
Secure temporary file handling
Sanitized file paths and names

Context-Aware Translation

Preserves sentence boundaries across chunks
Maintains translation context for consistency
Handles line-break hyphens

Name		Name	Last commit message	Last commit date
Latest commit History 220 Commits
src		src
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
prompts.py		prompts.py
requirements.txt		requirements.txt
test_placeholder_validation.py		test_placeholder_validation.py
test_post_processing_placeholders.py		test_post_processing_placeholders.py
translate.py		translate.py
translation_api.py		translation_api.py

hydropix/TranslateBookWithLLM

Folders and files

Latest commit

History

Repository files navigation

Features

Windows Installation Guide

1. Prerequisites: Software Installation

2. Setting up the Python Environment

3. Getting the Translation Application

4. Installing Dependencies

5. Preparing Ollama

6. Using the Application

Option A: Web Interface (Recommended)

Option B: Command Line Interface

EPUB File Support

SRT Subtitle File Support

Google Gemini Support

Docker Support

Quick Start with Docker

Docker Compose (Optional)

Advanced Configuration

Web Interface Settings

Configuration Files

Environment Variables (.env file)

prompts.py - Translation Prompts

Custom Instructions Feature

Post-processing Feature

Troubleshooting

Common Issues

Getting Help

Architecture

Project Structure

Root Level Files

Configuration Files

Translation Pipeline

Key Features Implementation

LLM Provider Architecture

Asynchronous Processing

Job Management System

Security Features

Context-Aware Translation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages