trans2

Paused

App Files Files Community

Mayo commited on Mar 27

Commit

0e8f75a

unverified ·

1 Parent(s): e8436c0

docs: add site

Browse files

Files changed (27) hide show

.github/workflows/docs.yml +29 -0
.gitignore +3 -1
README.md +1 -3
docs/README.ja.md +0 -205
docs/README.ru.md +0 -205
docs/README.zh-CN.md +0 -205
docs/assets/Koharu_Halo.png +3 -0
docs/assets/Koharu_Icon.png +3 -0
docs/assets/koharu-screenshot-en.png +3 -0
docs/assets/koharu-screenshot-ja.png +3 -0
docs/assets/koharu-screenshot-zh-CN.png +3 -0
docs/explanation/acceleration-and-runtime.md +47 -0
docs/explanation/how-koharu-works.md +37 -0
docs/explanation/index.md +13 -0
docs/explanation/models-and-providers.md +71 -0
docs/how-to/build-from-source.md +26 -0
docs/how-to/export-and-manage-projects.md +29 -0
docs/how-to/index.md +14 -0
docs/how-to/install-koharu.md +45 -0
docs/how-to/run-gui-headless-and-mcp.md +71 -0
docs/index.md +52 -0
docs/reference/cli.md +47 -0
docs/reference/index.md +12 -0
docs/reference/keyboard-shortcuts.md +13 -0
docs/tutorials/index.md +11 -0
docs/tutorials/translate-your-first-page.md +72 -0
zensical.toml +111 -0

.github/workflows/docs.yml ADDED Viewed

	@@ -0,0 +1,29 @@

+name: Documentation
+on:
+  push:
+    branches:
+      - master
+      - main
+permissions:
+  contents: read
+  pages: write
+  id-token: write
+jobs:
+  deploy:
+    environment:
+      name: github-pages
+      url: ${{ steps.deployment.outputs.page_url }}
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/configure-pages@v5
+      - uses: actions/checkout@v5
+      - uses: actions/setup-python@v5
+        with:
+          python-version: 3.x
+      - run: pip install zensical
+      - run: zensical build --clean
+      - uses: actions/upload-pages-artifact@v4
+        with:
+          path: site
+      - uses: actions/deploy-pages@v4
+        id: deployment

.gitignore CHANGED Viewed

@@ -54,4 +54,6 @@ test-results/
 AGENTS.md
 claudedocs/
 /.claude
-/.serena

 AGENTS.md
 claudedocs/
 /.claude
+# Zensical
+site/

README.md CHANGED Viewed

@@ -1,7 +1,5 @@
 # Koharu
-[日本語](./docs/README.ja.md) | [简体中文](./docs/README.zh-CN.md) | [Русский](./docs/README.ru.md)
 ML-powered manga translator, written in **Rust**.
 Koharu introduces a new workflow for manga translation, utilizing the power of ML to automate the process. It combines the capabilities of object detection, OCR, inpainting, and LLMs to create a seamless translation experience.
@@ -13,7 +11,7 @@ Under the hood, Koharu uses [candle](https://github.com/huggingface/candle) for
 ---
-![screenshot](assets/koharu-screenshot-en.png)
 > [!NOTE]
 > For help and support, please join our [Discord server](https://discord.gg/mHvHkxGnUY).

 # Koharu
 ML-powered manga translator, written in **Rust**.
 Koharu introduces a new workflow for manga translation, utilizing the power of ML to automate the process. It combines the capabilities of object detection, OCR, inpainting, and LLMs to create a seamless translation experience.
 ---
+![screenshot](docs/assets/koharu-screenshot-en.png)
 > [!NOTE]
 > For help and support, please join our [Discord server](https://discord.gg/mHvHkxGnUY).

docs/README.ja.md DELETED Viewed

@@ -1,205 +0,0 @@
-# Koharu
-**Rust**で書かれた、ML（機械学習）搭載のマンガ翻訳ツールです。
-Koharu は、ML の力を活用して翻訳工程を自動化する、新しいマンガ翻訳ワークフローを提供します。物体検出、OCR、インペインティング、LLM を組み合わせることで、シームレスな翻訳体験を実現します。
-内部では、高性能推論のために [candle](https://github.com/huggingface/candle) を使用し、GUI には [Tauri](https://github.com/tauri-apps/tauri) を採用しています。すべてのコンポーネントが Rust で書かれており、安全性と高速性を両立しています。
-> [!NOTE]
-> Koharu は既定で、ビジョンモデルとローカル LLM を **お使いの端末上** で実行します。リモート LLM プロバイダーを選択した場合、翻訳対象のテキストのみが設定したプロバイダーへ送信されます。Koharu 自体がユーザーデータを収集することはありません。
----
-![スクリーンショット](../assets/koharu-screenshot-ja.png)
-> [!NOTE]
-> ヘルプやサポートについては、[Discord サーバー](https://discord.gg/mHvHkxGnUY)に参加してください。
-## 特徴
-- セリフ（吹き出し）の自動検出とセグメンテーション
-- マンガ文字の認識のための OCR
-- 画像から元の文字を消すためのインペインティング
-- LLM による翻訳
-- CJK（中国語・日本語・韓国語）向けの縦書きレイアウト
-- 編集可能なテキスト付きのレイヤー PSD 書き出し
-- AI エージェントとの連携のための MCP サーバー
-## 使い方
-### ホットキー
-- <kbd>Ctrl</kbd> + マウスホイール: 拡大／縮小
-- <kbd>Ctrl</kbd> + ドラッグ: キャンバスのパン（移動）
-- <kbd>Del</kbd>: 選択したテキストブロックを削除
-### 書き出し
-Koharu は現在のページをレンダリング済み画像として書き出すだけでなく、レイヤー付きの Photoshop PSD としても書き出せます。PSD 書き出しでは補助レイヤーを保持しつつ、翻訳済みテキストを編集可能なテキストレイヤーとして保存できます。
-### MCP サーバー
-Koharu には MCP サーバーが内蔵されており、AI エージェントとの連携に使用できます。デフォルトでは、MCP サーバーはランダムなポートでリッスンしますが、`--port` フラグを使用してポートを指定できます。
-```bash
-# macOS / Linux
-koharu --port 9999
-# Windows
-koharu.exe --port 9999
-```
-AI エージェントの MCP サーバー URL フィールドに `http://localhost:9999/mcp` と入力してください。
-### ヘッドレスモード
-Koharu はコマンドラインからヘッドレスモードで実行できます。
-```bash
-# macOS / Linux
-koharu --port 4000 --headless
-# Windows
-koharu.exe --port 4000 --headless
-```
-これで、`http://localhost:4000` から Koharu Web UI にアクセスできます。
-### ファイルの関連付け
-Windows では、Koharu が自動的に `.khr` ファイルを関連付けるため、ダブルクリックで開けます。`.khr` ファイルは、内部に含まれる画像のサムネイルを表示するために、画像として開くこともできます。
-## GPU アクセラレーション
-CUDA と Metal による GPU アクセラレーションに対応しており、対応ハードウェアでは性能が大きく向上します。
-### CUDA
-Koharu は CUDA 対応ビルドが用意されており、NVIDIA GPU を活用してより高速に処理できます。
-Koharu には CUDA toolkit 13.1 と cuDNN 9.19 が同梱されており、dylib は初回起動時にアプリケーションデータディレクトリへ自動的に展開されます。
-> [!NOTE]
-> 最新の NVIDIA ドライバーがインストールされていることを確認してください。最新のドライバーは [NVIDIA App](https://www.nvidia.com/en-us/software/nvidia-app/) からダウンロードできます。
-#### 対応する NVIDIA GPU
-Koharu は、Compute Capability 7.5 以上の NVIDIA GPU に対応しています。
-お使いの GPU が対応しているかは、[CUDA GPU Compute Capability](https://developer.nvidia.com/cuda-gpus) と [cuDNN Support Matrix](https://docs.nvidia.com/deeplearning/cudnn/backend/latest/reference/support-matrix.html) を確認してください。
-### Metal
-Koharu は Apple Silicon（M1、M2 など）を搭載した macOS で Metal による GPU アクセラレーションに対応しています。これにより、幅広い Apple デバイスで効率的に動作します。
-### CPU フォールバック
-推論に CPU を使うよう強制することもできます。
-```bash
-# macOS / Linux
-koharu --cpu
-# Windows
-koharu.exe --cpu
-```
-## ML モデル
-Koharu は、コンピュータビジョンと自然言語処��のモデルを組み合わせて各処理を実行します。
-### コンピュータビジョンモデル
-Koharu は用途ごとに複数の学習済みモデルを使用します。
-- [PP-DocLayoutV3](https://huggingface.co/PaddlePaddle/PP-DocLayoutV3_safetensors) テキスト検出とレイアウト分析のため
-- [comic-text-detector](https://huggingface.co/mayocream/comic-text-detector) テキストセグメンテーションのため
-- [PaddleOCR-VL-1.5](https://huggingface.co/PaddlePaddle/PaddleOCR-VL-1.5) OCR テキスト認識のため
-- [lama-manga](https://huggingface.co/mayocream/lama-manga) インペインティングのため
-- [YuzuMarker.FontDetection](https://huggingface.co/fffonion/yuzumarker-font-detection)　フォントと色の検出のため
-モデルは Koharu を初めて実行した際に自動的にダウンロードされます。
-Koharu では、性能と Rust との互換性を高めるため、元のモデルを safetensors 形式へ変換しています。変換済みモデルは [Hugging Face](https://huggingface.co/mayocream) 上でホストしています。
-### 大規模言語モデル（LLM）
-Koharu はローカル LLM とリモート LLM の両方に対応しており、可能な場合はシステムのロケール設定に基づいてモデルを事前選択します。
-#### ローカル LLM
-Koharu は [candle](https://github.com/huggingface/candle) を通じて、GGUF 形式の量子化 LLM を利用できます。これらのモデルは端末上で動作し、設定で選択したタイミングで必要に応じて自動ダウンロードされます。対応モデルと推奨用途は以下の通りです。
-英語への翻訳:
-- [vntl-llama3-8b-v2](https://huggingface.co/lmg-anon/vntl-llama3-8b-v2-gguf): Q8_0 の重みサイズが約 8.5 GB。精度を最優先したい場合に最適で、VRAM 10 GB 以上、または CPU 推論なら十分なシステム RAM を推奨します。
-- [lfm2-350m-enjp-mt](https://huggingface.co/LiquidAI/LFM2-350M-ENJP-MT-GGUF): 超軽量（約 350M、Q8_0）。CPU や低メモリ GPU でも快適に動作し、クイックプレビューや低スペック環境に最適ですが、品質は低下します。
-中国語への翻訳:
-- [sakura-galtransl-7b-v3.7](https://huggingface.co/SakuraLLM/Sakura-GalTransl-7B-v3.7): 約 6.3 GB。VRAM 8 GB に収まり、品質と速度のバランスが良好です。
-- [sakura-1.5b-qwen2.5-v1.0](https://huggingface.co/shing3232/Sakura-1.5B-Qwen2.5-v1.0-GGUF-IMX): 軽量（約 1.5B、Q5KS）。ミドルレンジ GPU（VRAM 4〜6 GB）や CPU のみの環境でも、適度な RAM があれば動作します。7B/8B より高速で、Qwen 系トークナイザの挙動も維持します。
-その他の言語:
-- [hunyuan-7b-mt-v1.0](https://huggingface.co/Mungert/Hunyuan-MT-7B-GGUF): 約 6.3GB。VRAM 8 GB に収まり、マルチ言語の翻訳品質も良好です。
-LLM は、設定でモデルを選択したタイミングで必要に応じて自動ダウンロードされます。メモリが限られている場合は、品質要件を満たす範囲で最小のモデルを選んでください。十分な VRAM/RAM がある場合は、より良い翻訳のために 7B/8B 系を推奨します。
-#### リモート LLM
-Koharu は、ローカルモデルをダウンロードしなくても、リモートまたはセルフホストの API プロバイダー経由で翻訳できます。対応するリモートプロバイダーは以下の通りです。
-- OpenAI
-- Gemini
-- Claude
-- DeepSeek
-- OpenAI Compatible: LM Studio、OpenRouter、または OpenAI 形式の `/v1/models` と `/v1/chat/completions` API を提供する任意のエンドポイント
-リモートプロバイダーは **Settings > API Keys** で設定します。OpenAI Compatible ではカスタムの Base URL も指定します。LM Studio のようなローカルサーバーでは API キーが不要な場合がありますが、OpenRouter のようなホスト型サービスでは通常 API キーが必要です。
-ローカルモデルのダウンロードを避けたい場合、端末側の VRAM/RAM 使用量を抑えたい場合、またはホスト型モデルへ接続したい場合は、リモートプロバイダーを利用してください。翻訳対象として選択した OCR テキストは、設定したプロバイダーへ送信されます。
-## インストール
-最新のリリースは [releases ページ](https://github.com/mayocream/koharu/releases/latest) からダウンロードできます。
-Windows、macOS、Linux 向けにビルド済みバイナリを提供しています。その他のプラットフォームではソースからビルドが必要な場合があります。詳細は下記の [開発](#開発) セクションを参照してください。
-## 開発
-Koharu をソースからビルドするには、以下の手順に従ってください。
-### 前提条件
-- [Rust](https://www.rust-lang.org/tools/install)（1.92 以上）
-- [Bun](https://bun.sh/)（1.0 以上）
-### 依存関係のインストール
-```bash
-bun install
-```
-### ビルド
-```bash
-bun run build
-```
-ビルドされたバイナリは `target/release` ディレクトリに生成されます。
-## スポンサー
-Koharu が役に立った場合は、開発支援のためにスポンサーをご検討ください。
-- [GitHub Sponsors](https://github.com/sponsors/mayocream)
-- [Patreon](https://www.patreon.com/mayocream)
-## 貢献者
-<a href="https://github.com/mayocream/koharu/graphs/contributors">
-  <img src="https://contrib.rocks/image?repo=mayocream/koharu" />
-</a>
-## ライセンス
-Koharu は [GNU General Public License v3.0](../LICENSE) の下でライセンスされています。

docs/README.ru.md DELETED Viewed

@@ -1,205 +0,0 @@
-# Koharu
-Переводчик манги на основе ML, написанный на **Rust**.
-Koharu предлагает новый рабочий процесс перевода манги, используя возможности машинного обучения для автоматизации. Он объединяет детекцию объектов, OCR, инпейнтинг и LLM для создания бесшовного процесса перевода.
-Под капотом Koharu использует [candle](https://github.com/huggingface/candle) для высокопроизводительного инференса и [Tauri](https://github.com/tauri-apps/tauri) для графического интерфейса. Все компоненты написаны на Rust, что обеспечивает безопасность и скорость.
-> [!NOTE]
-> По умолчанию Koharu запускает модели компьютерного зрения и локальные LLM **на вашем устройстве**. Если вы выберете удалённого LLM-провайдера, Koharu отправляет провайдеру только текст для перевода. Koharu не собирает пользовательские данные.
----
-![скриншот](../assets/koharu-screenshot-en.png)
-> [!NOTE]
-> Для помощи и поддержки присоединяйтесь к нашему [Discord-серверу](https://discord.gg/mHvHkxGnUY).
-## Возможности
-- Автоматическое обнаружение и сегментация речевых пузырей
-- OCR для распознавания текста в манге
-- Инпейнтинг для удаления исходного текста с изображений
-- Перевод с помощью LLM
-- Вертикальная вёрстка текста для CJK-языков
-- Экспорт в многослойный PSD с редактируемым текстом
-- MCP-сервер для интеграции с ИИ-агентами
-## Использование
-### Горячие клавиши
-- <kbd>Ctrl</kbd> + колесо мыши: масштабирование
-- <kbd>Ctrl</kbd> + перетаскивание: панорамирование холста
-- <kbd>Del</kbd>: удалить выбранный текстовый блок
-### Экспорт
-Koharu может экспортировать текущую страницу как отрендеренное изображение или как многослойный PSD для Photoshop. При экспорте в PSD сохраняются вспомогательные слои, а переведённый текст записывается как редактируемые текстовые слои для дальнейшей доработки в Photoshop.
-### MCP-сервер
-Koharu имеет встроенный MCP-сервер для интеграции с ИИ-агентами. По умолчанию MCP-сервер слушает на случайном порту, но порт можно указать с помощью флага `--port`.
-```bash
-# macOS / Linux
-koharu --port 9999
-# Windows
-koharu.exe --port 9999
-```
-Введите `http://localhost:9999/mcp` в поле URL MCP-сервера вашего ИИ-агента.
-### Безголовый режим
-Koharu можно запустить в безголовом режиме через командную строку.
-```bash
-# macOS / Linux
-koharu --port 4000 --headless
-# Windows
-koharu.exe --port 4000 --headless
-```
-Теперь вы можете открыть веб-интерфейс Koharu по адресу `http://localhost:4000`.
-### Ассоциация файлов
-В Windows Koharu автоматически ассоциируется с файлами `.khr`, так что их можно открывать двойным щелчком. Файлы `.khr` также можно открывать как изображения для просмотра миниатюр содержащихся в них изображений.
-## Ускорение на GPU
-Поддерживается ускорение на GPU через CUDA и Metal, что значительно повышает производительность на совместимом оборудовании.
-### CUDA
-Koharu собран с поддержкой CUDA, что позволяет использовать мощность GPU NVIDIA для ускорения обработки.
-Koharu включает CUDA toolkit 13.1 и cuDNN 9.19, динамические библиотеки автоматически извлекаются в каталог данных приложения при первом запуске.
-> [!NOTE]
-> Убедитесь, что у вас установлены последние драйверы NVIDIA. Скачать последние драйверы можно через [NVIDIA App](https://www.nvidia.com/en-us/software/nvidia-app/).
-#### Поддерживаемые GPU NVIDIA
-Koharu поддерживает GPU NVIDIA с Compute Capability 7.5 и выше.
-Проверьте совместимость вашего GPU: [CUDA GPU Compute Capability](https://developer.nvidia.com/cuda-gpus) и [cuDNN Support Matrix](https://docs.nvidia.com/deeplearning/cudnn/backend/latest/reference/support-matrix.html).
-### Metal
-Koharu поддерживает Metal для ускорения на GPU в macOS с Apple Silicon (M1, M2 и т.д.). Это позволяет эффективно работать на широком спектре устройств Apple.
-### Откат на CPU
-Вы всегда можете принудительно использовать CPU для инференса:
-```bash
-# macOS / Linux
-koharu --cpu
-# Windows
-koharu.exe --cpu
-```
-## ML-модели
-Koharu использует комбинацию моделей компьютерного зрения и обработки естественного языка.
-### Модели компьютерного зрения
-Koharu использует несколько предобученных моделей для различных задач:
-- [PP-DocLayoutV3](https://huggingface.co/PaddlePaddle/PP-DocLayoutV3_safetensors) для детекции текста и анализа макета
-- [comic-text-detector](https://huggingface.co/mayocream/comic-text-detector) для сегментации текста
-- [PaddleOCR-VL-1.5](https://huggingface.co/PaddlePaddle/PaddleOCR-VL-1.5) для распознавания текста (OCR)
-- [lama-manga](https://huggingface.co/mayocream/lama-manga) для инпейнтинга
-- [YuzuMarker.FontDetection](https://huggingface.co/fffonion/yuzumarker-font-detection) для определения шрифта и цвета
-Модели автоматически загружаются при первом запуске Koharu.
-Мы конвертируем оригинальные модели в формат safetensors для лучшей производительности и совместимости с Rust. Конвертированные модели размещены на [Hugging Face](https://huggingface.co/mayocream).
-### Большие языковые модели (LLM)
-Koharu поддерживает как локальные, так и удалённые LLM-бэкенды и по возможности предварительно выбирает модель на основе системной локали.
-#### Локальные LLM
-Koharu поддерживает различные квантизированные LLM в формате GGUF через [candle](https://github.com/huggingface/candle). Эти модели работают на вашем устройстве и загружаются по запросу при выборе в настройках. Поддерживаемые модели и рекомендации:
-Для перевода на английский:
-- [vntl-llama3-8b-v2](https://huggingface.co/lmg-anon/vntl-llama3-8b-v2-gguf): ~8.5 ГБ (Q8_0). Рекомендуется VRAM ≥10 ГБ или достаточно оперативной памяти для CPU-инференса. Лучший выбор, когда важна точность.
-- [lfm2-350m-enjp-mt](https://huggingface.co/LiquidAI/LFM2-350M-ENJP-MT-GGUF): сверхлёгкая (~350M, Q8_0). Комфортно работает на CPU и GPU с малым объёмом памяти. Идеальна для быстрого предпросмотра или слабых машин, но качество ниже.
-Для перевода на китайский:
-- [sakura-galtransl-7b-v3.7](https://huggingface.co/SakuraLLM/Sakura-GalTransl-7B-v3.7): ~6.3 ГБ, помещается в 8 ГБ VRAM. Хороший баланс качества и скорости.
-- [sakura-1.5b-qwen2.5-v1.0](https://huggingface.co/shing3232/Sakura-1.5B-Qwen2.5-v1.0-GGUF-IMX): лёгкая (~1.5B, Q5KS). Подходит для GPU среднего уровня (4–6 ГБ VRAM) или CPU с достаточным объёмом RAM. Быстрее 7B/8B моделей.
-Для других языков:
-- [hunyuan-7b-mt-v1.0](https://huggingface.co/Mungert/Hunyuan-MT-7B-GGUF): ~6.3 ГБ, помещается в 8 ГБ VRAM. Достойное качество мультиязычного перевода.
-LLM автоматически загружаются при выборе модели в настройках. Если память ограничена, выбирайте наименьшую модель, удовлетворяющую вашим требованиям к качеству. При достаточном объёме VRAM/RAM предпочтительны моде��и 7B/8B для лучшего перевода.
-#### Удалённые LLM
-Koharu также может переводить через удалённые или самостоятельно размещённые API-провайдеры вместо загруженной локальной модели. Поддерживаемые удалённые провайдеры:
-- OpenAI
-- Gemini
-- Claude
-- DeepSeek
-- OpenAI Compatible: LM Studio, OpenRouter или любой эндпоинт, предоставляющий API в стиле OpenAI (`/v1/models` и `/v1/chat/completions`)
-Удалённые провайдеры настраиваются в **Настройки > API-ключи**. Для OpenAI Compatible также указывается пользовательский Base URL. API-ключи необязательны для локальных серверов вроде LM Studio, но обычно требуются для размещённых сервисов вроде OpenRouter.
-Используйте удалённых провайдеров, если хотите избежать загрузки локальных моделей, снизить использование VRAM/RAM или подключить Koharu к размещённой модели. Учтите, что текст OCR, выбранный для перевода, отправляется настроенному провайдеру.
-## Установка
-Последнюю версию Koharu можно скачать со [страницы релизов](https://github.com/mayocream/koharu/releases/latest).
-Мы предоставляем готовые сборки для Windows, macOS и Linux. Для других платформ может потребоваться сборка из исходников — см. раздел [Разработка](#разработка) ниже.
-## Разработка
-Чтобы собрать Koharu из исходников, выполните следующие шаги.
-### Необходимые компоненты
-- [Rust](https://www.rust-lang.org/tools/install) (1.92 или новее)
-- [Bun](https://bun.sh/) (1.0 или новее)
-### Установка зависимостей
-```bash
-bun install
-```
-### Сборка
-```bash
-bun run build
-```
-Собранные бинарные файлы будут в каталоге `target/release`.
-## Спонсорство
-Если Koharu оказался полезен, рассмотрите возможность спонсировать проект для поддержки его развития!
-- [GitHub Sponsors](https://github.com/sponsors/mayocream)
-- [Patreon](https://www.patreon.com/mayocream)
-## Участники
-<a href="https://github.com/mayocream/koharu/graphs/contributors">
-  <img src="https://contrib.rocks/image?repo=mayocream/koharu" />
-</a>
-## Лицензия
-Koharu лицензирован под [GNU General Public License v3.0](../LICENSE).

docs/README.zh-CN.md DELETED Viewed

@@ -1,205 +0,0 @@
-# Koharu
-基于机器学习（ML）的漫画翻译工具，使用 **Rust** 编写。
-Koharu 引入了一种新的漫画翻译工作流，利用机器学习能力自动化翻译流程。它将目标检测、OCR、图像修复（inpainting）和 LLM 结合起来，提供流畅的一体化翻译体验。
-在底层实现中，Koharu 使用 [candle](https://github.com/huggingface/candle) 进行高性能推理，使用 [Tauri](https://github.com/tauri-apps/tauri) 构建 GUI。所有组件均使用 Rust 编写，兼顾安全性与性能。
-> [!NOTE]
-> Koharu 默认会在你的本地设备上运行视觉模型和本地 LLM。如果你选择远程 LLM 提供商，只有待翻译的文本会发送到你配置的提供商。Koharu 本身不会收集任何用户数据。
----
-![screenshot](../assets/koharu-screenshot-zh-CN.png)
-> [!NOTE]
-> 如需帮助与支持，请加入我们的 [Discord 服务器](https://discord.gg/mHvHkxGnUY)。
-## 功能特性
-- 自动检测并分割对话气泡
-- 使用 OCR 识别漫画文字
-- 通过图像修复去除原图文字
-- 基于 LLM 的翻译
-- 面向 CJK 语言的竖排文本布局
-- 支持导出带可编辑文字图层的 PSD
-- 面向 AI Agent 的 MCP 服务器
-## 使用方法
-### 快捷键
-- <kbd>Ctrl</kbd> + 鼠标滚轮：缩放
-- <kbd>Ctrl</kbd> + 拖动：平移画布
-- <kbd>Del</kbd>：删除选中的文本块
-### 导出
-Koharu 既可以将当前页面导出为渲染后的图片，也可以导出为带图层的 Photoshop PSD。PSD 导出会保留辅助图层，并将翻译后的文字写成可编辑的文字图层，方便在 Photoshop 中继续调整。
-### MCP 服务器
-Koharu 内置 MCP 服务器，可用于与 AI Agent 集成。默认情况下，MCP 服务器会监听一个随机端口；你也可以通过 `--port` 参数指定端口。
-```bash
-# macOS / Linux
-koharu --port 9999
-# Windows
-koharu.exe --port 9999
-```
-然后在你的 AI Agent 的 MCP Server URL 字段中填写 `http://localhost:9999/mcp`。
-### 无界面模式（Headless Mode）
-Koharu 支持通过命令行以无界面模式运行。
-```bash
-# macOS / Linux
-koharu --port 4000 --headless
-# Windows
-koharu.exe --port 4000 --headless
-```
-现在你可以通过 `http://localhost:4000` 访问 Koharu Web UI。
-### 文件关联
-在 Windows 上，Koharu 会自动关联 `.khr` 文件，因此可以直接双击打开。`.khr` 文件也可以作为图片打开，以查看其中图像的缩略图。
-## GPU 加速
-Koharu 支持 CUDA 和 Metal GPU 加速，可在受支持硬件上显著提升性能。
-### CUDA
-Koharu 提供 CUDA 支持，可利用 NVIDIA GPU 实现更快处理。
-Koharu 内置 CUDA toolkit 13.1 和 cuDNN 9.19，相关动态库会在首次运行时自动解压到应用数据目录。
-> [!NOTE]
-> 请确保系统已安装最新 NVIDIA 驱动。你可以通过 [NVIDIA App](https://www.nvidia.com/en-us/software/nvidia-app/) 下载最新版驱动。
-#### 支持的 NVIDIA GPU
-Koharu 支持计算能力（Compute Capability）7.5 及以上的 NVIDIA GPU。
-请通过 [CUDA GPU Compute Capability](https://developer.nvidia.com/cuda-gpus) 和 [cuDNN Support Matrix](https://docs.nvidia.com/deeplearning/cudnn/backend/latest/reference/support-matrix.html) 确认你的 GPU 是否受支持。
-### Metal
-Koharu 支持在搭载 Apple Silicon（M1、M2 等）的 macOS 上使用 Metal 进行 GPU 加速，可在多种 Apple 设备上高效运行。
-### CPU 回退
-你也可以强制 Koharu 使用 CPU 进行推理：
-```bash
-# macOS / Linux
-koharu --cpu
-# Windows
-koharu.exe --cpu
-```
-## ML 模型
-Koharu 结合计算机视觉与自然语言处理模型来完成各项任务。
-### 计算机视觉模型
-Koharu 在不同任务中使用多个预训练模型：
-- [PP-DocLayoutV3](https://huggingface.co/PaddlePaddle/PP-DocLayoutV3_safetensors) 用于文本检测和布局分析
-- [comic-text-detector](https://huggingface.co/mayocream/comic-text-detector) 用于生成文本遮罩
-- [PaddleOCR-VL-1.5](https://huggingface.co/PaddlePaddle/PaddleOCR-VL-1.5) 用于 OCR 文本识别
-- [lama-manga](https://huggingface.co/mayocream/lama-manga) 用于图像修复
-- [YuzuMarker.FontDetection](https://huggingface.co/fffonion/yuzumarker-font-detection) 用于字体和颜色检测
-这些模型会在你首次运行 Koharu 时自动下载。
-为了提升性能并增强 Rust 生态兼容性，我们将原始模型转换为 safetensors 格式。转换后的模型托管在 [Hugging Face](https://huggingface.co/mayocream)。
-### 大语言模型（LLM）
-Koharu 同时支持本地和远程 LLM 后端，并会在可能时根据系统语言环境预选模型。
-#### 本地 LLM
-Koharu 通过 [candle](https://github.com/huggingface/candle) 支持 GGUF 格式的量化 LLM。这些模型在本机运行，并会在你于设置中选中它们时按需自动下载。支持模型与推荐使用场景如下：
-翻译为英文：
-- [vntl-llama3-8b-v2](https://huggingface.co/lmg-anon/vntl-llama3-8b-v2-gguf)：约 8.5 GB（Q8_0）权重，建议 >=10 GB VRAM，或在 CPU 推理时配备充足系统内存。更适合对准确度要求高的场景。
-- [lfm2-350m-enjp-mt](https://huggingface.co/LiquidAI/LFM2-350M-ENJP-MT-GGUF)：超轻量（约 350M，Q8_0）；在 CPU 和低显存 GPU 上也能流畅运行，适合快速预览或低配设备，但质量会有所下降。
-翻译为中文：
-- [sakura-galtransl-7b-v3.7](https://huggingface.co/SakuraLLM/Sakura-GalTransl-7B-v3.7)：约 6.3 GB，可在 8 GB VRAM 上运行，质量与速度平衡良好。
-- [sakura-1.5b-qwen2.5-v1.0](https://huggingface.co/shing3232/Sakura-1.5B-Qwen2.5-v1.0-GGUF-IMX)：轻量（约 1.5B，Q5KS）；适合中端 GPU（4-6 GB VRAM）或纯 CPU 环境（需中等内存），速度快于 7B/8B，同时保留 Qwen 系 tokenizer 行为。
-翻译为其他语言：
-- [hunyuan-7b-mt-v1.0](https://huggingface.co/Mungert/Hunyuan-MT-7B-GGUF)：约 6.3 GB，可在 8 GB VRAM 上运行，具备较好的多语言翻译能力。
-当你在设置中选择模型时，LLM 会按需自动下载。如果内存受限，建议优先选择满足质量要求的最小模型；若 VRAM/RAM 充足，优先选择 7B/8B 模型以获得更佳翻译效果。
-#### 远程 LLM
-Koharu 也可以通过远程或自托管 API 提供商进行翻译，而无需下载本地模型。支持的远程提供商如下：
-- OpenAI
-- Gemini
-- Claude
-- DeepSeek
-- OpenAI Compatible，包括 LM Studio、OpenRouter，或任何提供 OpenAI 风格 `/v1/models` 和 `/v1/chat/completions` API 的服务
-远程提供商在 **Settings > API Keys** 中配置。对于 OpenAI Compatible，你还需要设置自定义 Base URL。像 LM Studio 这样的本地服务通常可以不填 API Key，而 OpenRouter 这类托管服务通常需要 API Key。
-如果你希望避免下载本地模型、减少本地 VRAM/RAM 占用，或者希望接入托管模型，可以选择远程提供商。需要注意的是，被选中用于翻译的 OCR 文本会发送到所配置的提供商。
-## 安装
-你可以在 [releases 页面](https://github.com/mayocream/koharu/releases/latest) 下载 Koharu 的最新版本。
-我们提供 Windows、macOS 和 Linux 的预构建二进制包。其他平台可能需要从源码构建，详见下方 [开发](#开发) 部分。
-## 开发
-按以下步骤从源码构建 Koharu。
-### 前置要求
-- [Rust](https://www.rust-lang.org/tools/install)（1.92 或更高）
-- [Bun](https://bun.sh/)（1.0 或更高）
-### 安装依赖
-```bash
-bun install
-```
-### 构建
-```bash
-bun run build
-```
-构建产物位于 `target/release` 目录。
-## 赞助
-如果 Koharu 对你有帮助，欢迎赞助项目以支持持续开发。
-- [GitHub Sponsors](https://github.com/sponsors/mayocream)
-- [Patreon](https://www.patreon.com/mayocream)
-## 贡献者
-<a href="https://github.com/mayocream/koharu/graphs/contributors">
-  <img src="https://contrib.rocks/image?repo=mayocream/koharu" />
-</a>
-## 许可证
-Koharu 使用 [GNU General Public License v3.0](../LICENSE) 授权。

docs/assets/Koharu_Halo.png ADDED Viewed

Git LFS Details

SHA256: 8e07c095ee8e80b8c9c25d39693c3afb2ea91b5e98f96e750f48074467579834
Pointer size: 131 Bytes
Size of remote file: 128 kB

docs/assets/Koharu_Icon.png ADDED Viewed

Git LFS Details

SHA256: 35868875403430b9ad12821d9d452be342c079b21019c7e9ecb17454e7bc92eb
Pointer size: 130 Bytes
Size of remote file: 79.7 kB

docs/assets/koharu-screenshot-en.png ADDED Viewed

Git LFS Details

SHA256: c75068d9336d0974f821748b0a7254fecde205e6f8afb830263f3fab4a3dcbd8
Pointer size: 132 Bytes
Size of remote file: 2.22 MB

docs/assets/koharu-screenshot-ja.png ADDED Viewed

Git LFS Details

SHA256: 601326f20fe90797c7adaea729d0285c018832523354d5be37fc19b3ff385575
Pointer size: 132 Bytes
Size of remote file: 2.22 MB

docs/assets/koharu-screenshot-zh-CN.png ADDED Viewed

Git LFS Details

SHA256: 71eee83c8b223e2e1010d73f9a80178bfe11e95b92b28699675dce5c2ba7664d
Pointer size: 132 Bytes
Size of remote file: 2.24 MB

docs/explanation/acceleration-and-runtime.md ADDED Viewed

	@@ -0,0 +1,47 @@

+---
+title: Acceleration and Runtime
+---
+# Acceleration and Runtime
+Koharu supports multiple runtime paths so it can run well on a wide range of hardware.
+## CUDA on NVIDIA GPUs
+CUDA is the main GPU acceleration path on systems with supported NVIDIA hardware.
+- Koharu supports NVIDIA GPUs with compute capability 7.5 or higher
+- Koharu bundles CUDA toolkit 13.1
+- Koharu bundles cuDNN 9.19
+On first run, the required dynamic libraries are extracted to the application data directory.
+!!! note
+    CUDA acceleration depends on a recent NVIDIA driver. If the driver does not support CUDA 13.1, Koharu falls back to CPU.
+## Metal on Apple Silicon
+On macOS, Koharu supports Metal acceleration for Apple Silicon devices such as M1 and M2 systems.
+## CPU fallback
+Koharu can always run on CPU when GPU acceleration is unavailable or when you force CPU mode explicitly.
+```bash
+# macOS / Linux
+koharu --cpu
+# Windows
+koharu.exe --cpu
+```
+## Why fallback matters
+Fallback behavior makes Koharu usable on more machines, but it changes the experience:
+- GPU inference is much faster when supported
+- CPU mode is more compatible but can be substantially slower
+- Smaller local LLMs are often the best choice on CPU-only systems
+For exact model choices, see [Models and Providers](models-and-providers.md).

docs/explanation/how-koharu-works.md ADDED Viewed

	@@ -0,0 +1,37 @@

+---
+title: How Koharu Works
+---
+# How Koharu Works
+Koharu is built around a translation pipeline for manga pages.
+## The core workflow
+For a typical page, Koharu combines several stages:
+1. Text detection and layout analysis
+2. Text region segmentation
+3. OCR text recognition
+4. Inpainting to remove original text
+5. LLM-based translation
+6. Text rendering and export
+This lets one application handle both the language work and much of the visual cleanup.
+## Why the stack matters
+Koharu uses:
+- [candle](https://github.com/huggingface/candle) for high-performance inference
+- [Tauri](https://github.com/tauri-apps/tauri) for the desktop app shell
+- Rust across the stack for performance and memory safety
+## Local-first design
+By default, Koharu runs:
+- vision models locally
+- local LLMs locally
+If you configure a remote LLM provider, Koharu sends only the text selected for translation to that provider.

docs/explanation/index.md ADDED Viewed

	@@ -0,0 +1,13 @@

+---
+title: Explanation
+---
+# Explanation
+Explanation pages describe how Koharu is put together and why it behaves the way it does.
+## Topics
+- [How Koharu Works](how-koharu-works.md)
+- [Acceleration and Runtime](acceleration-and-runtime.md)
+- [Models and Providers](models-and-providers.md)

docs/explanation/models-and-providers.md ADDED Viewed

	@@ -0,0 +1,71 @@

+---
+title: Models and Providers
+---
+# Models and Providers
+Koharu uses both vision models and language models. The vision stack prepares the page; the language stack handles translation.
+## Vision models
+Koharu automatically downloads the required vision models when you use them for the first time.
+The default stack includes:
+- [PP-DocLayoutV3](https://huggingface.co/PaddlePaddle/PP-DocLayoutV3_safetensors) for text detection and layout analysis
+- [comic-text-detector](https://huggingface.co/mayocream/comic-text-detector) for text segmentation
+- [PaddleOCR-VL-1.5](https://huggingface.co/PaddlePaddle/PaddleOCR-VL-1.5) for OCR text recognition
+- [lama-manga](https://huggingface.co/mayocream/lama-manga) for inpainting
+- [YuzuMarker.FontDetection](https://huggingface.co/fffonion/yuzumarker-font-detection) for font and color detection
+Converted model weights are hosted on [Hugging Face](https://huggingface.co/mayocream) in safetensors format for Rust compatibility and performance.
+## Local LLMs
+Koharu supports local GGUF models through [candle](https://github.com/huggingface/candle). These models run on your machine and are downloaded on demand when you select them in Settings.
+### Suggested local models for English output
+- [vntl-llama3-8b-v2](https://huggingface.co/lmg-anon/vntl-llama3-8b-v2-gguf): around 8.5 GB in Q8_0 form, best when translation quality matters most
+- [lfm2-350m-enjp-mt](https://huggingface.co/LiquidAI/LFM2-350M-ENJP-MT-GGUF): very small and useful for low-memory systems or quick previews
+### Suggested local models for Chinese output
+- [sakura-galtransl-7b-v3.7](https://huggingface.co/SakuraLLM/Sakura-GalTransl-7B-v3.7): a balanced choice for quality and speed on 8 GB class GPUs
+- [sakura-1.5b-qwen2.5-v1.0](https://huggingface.co/shing3232/Sakura-1.5B-Qwen2.5-v1.0-GGUF-IMX): a lighter option for mid-range or CPU-heavy setups
+### Suggested local model for broader language coverage
+- [hunyuan-7b-mt-v1.0](https://huggingface.co/Mungert/Hunyuan-MT-7B-GGUF): a multi-language option with moderate hardware requirements
+## Remote providers
+Koharu can translate through remote or self-hosted APIs instead of downloading a local model.
+Supported providers include:
+- OpenAI
+- Gemini
+- Claude
+- DeepSeek
+- OpenAI-compatible APIs such as LM Studio, OpenRouter, or any endpoint that exposes `/v1/models` and `/v1/chat/completions`
+Remote providers are configured in **Settings > API Keys**.
+## Choosing between local and remote
+Use local models when you want:
+- the most private setup
+- offline operation after downloads complete
+- tighter control over hardware usage
+Use remote providers when you want:
+- to avoid large local model downloads
+- to reduce local VRAM or RAM usage
+- to connect to a hosted or self-managed model service
+!!! note
+    When you use a remote provider, Koharu sends OCR text selected for translation to the provider you configured.

docs/how-to/build-from-source.md ADDED Viewed

	@@ -0,0 +1,26 @@

+---
+title: Build From Source
+---
+# Build From Source
+If you do not want to use a release build, you can compile Koharu locally.
+## Prerequisites
+- [Rust](https://www.rust-lang.org/tools/install) 1.92 or later
+- [Bun](https://bun.sh/) 1.0 or later
+## Install dependencies
+```bash
+bun install
+```
+## Build the project
+```bash
+bun run build
+```
+The built binaries will be placed in `target/release`.

docs/how-to/export-and-manage-projects.md ADDED Viewed

	@@ -0,0 +1,29 @@

+---
+title: Export Pages and Manage Projects
+---
+# Export Pages and Manage Projects
+## Export rendered output
+Koharu can export the current page as a rendered image.
+Use this when you want a final flattened result for reading, sharing, or publishing.
+## Export layered PSD files
+Koharu can also export a layered Photoshop PSD.
+PSD export preserves helper layers and writes translated text as editable text layers, which makes final cleanup in Photoshop much easier.
+## Work with `.khr` project files
+Koharu stores project data in `.khr` files.
+On Windows, Koharu automatically associates `.khr` files so they can be opened by double-clicking. These files can also be viewed in ways that expose the thumbnails of their contained images.
+## When to use each format
+- Rendered image: best for final delivery
+- PSD: best for manual cleanup and touch-up work
+- `.khr`: best for saving in-progress Koharu projects

docs/how-to/index.md ADDED Viewed

	@@ -0,0 +1,14 @@

+---
+title: How-To Guides
+---
+# How-To Guides
+How-to guides focus on specific jobs you may want to complete with Koharu.
+## Common tasks
+- [Install Koharu](install-koharu.md)
+- [Run GUI, Headless, and MCP Modes](run-gui-headless-and-mcp.md)
+- [Export Pages and Manage Projects](export-and-manage-projects.md)
+- [Build From Source](build-from-source.md)

docs/how-to/install-koharu.md ADDED Viewed

	@@ -0,0 +1,45 @@

+---
+title: Install Koharu
+---
+# Install Koharu
+## Download a release build
+Download the latest release from the [Koharu releases page](https://github.com/mayocream/koharu/releases/latest).
+Koharu provides prebuilt binaries for:
+- Windows
+- macOS
+- Linux
+If your platform is not covered by a release build, use [Build From Source](build-from-source.md).
+## First launch expectations
+On first run, Koharu may:
+- extract bundled runtime libraries
+- download required vision models
+- download local LLMs later when you select them in Settings
+This is normal and can take time depending on your connection and hardware.
+## GPU acceleration notes
+Koharu supports:
+- CUDA on supported NVIDIA GPUs
+- Metal on Apple Silicon Macs
+- CPU fallback on all platforms
+For CUDA, Koharu bundles CUDA toolkit 13.1 and cuDNN 9.19, then extracts the required dynamic libraries into the app data directory on first run.
+!!! note
+    Keep your NVIDIA driver up to date. Koharu checks for CUDA 13.1 support and falls back to CPU if the driver is too old.
+## Need help?
+For support, join the [Discord server](https://discord.gg/mHvHkxGnUY).

docs/how-to/run-gui-headless-and-mcp.md ADDED Viewed

	@@ -0,0 +1,71 @@

+---
+title: Run GUI, Headless, and MCP Modes
+---
+# Run GUI, Headless, and MCP Modes
+Koharu can run as a normal desktop app, a headless local server with a Web UI, or an MCP server for AI agents.
+## Run the desktop app
+Launch Koharu normally from your installed application.
+This is the default mode and is the best choice for most users.
+## Run headless mode
+Headless mode starts the local HTTP server without opening the desktop GUI.
+```bash
+# macOS / Linux
+koharu --port 4000 --headless
+# Windows
+koharu.exe --port 4000 --headless
+```
+After startup, open the Web UI at `http://localhost:4000`.
+## Run with a fixed port
+By default, Koharu uses a random local port. Use `--port` when you need a stable address.
+```bash
+# macOS / Linux
+koharu --port 9999
+# Windows
+koharu.exe --port 9999
+```
+## Connect to the MCP server
+Koharu includes a built-in MCP server. When you run Koharu on a fixed port, point your AI agent at:
+`http://localhost:9999/mcp`
+Replace `9999` with the port you chose.
+## Force CPU mode
+Use `--cpu` when you want to disable GPU inference explicitly.
+```bash
+# macOS / Linux
+koharu --cpu
+# Windows
+koharu.exe --cpu
+```
+## Download runtime dependencies only
+Use `--download` if you want Koharu to fetch runtime packages and exit without starting the app.
+```bash
+# macOS / Linux
+koharu --download
+# Windows
+koharu.exe --download
+```

docs/index.md ADDED Viewed

	@@ -0,0 +1,52 @@

+---
+title: Overview
+---
+# Koharu
+ML-powered manga translator, written in **Rust**.
+Koharu introduces a practical workflow for manga translation. It combines object detection, OCR, inpainting, and LLM-assisted translation so you can move from raw page to cleaned export in one tool.
+Under the hood, Koharu uses [candle](https://github.com/huggingface/candle) for high-performance inference and [Tauri](https://github.com/tauri-apps/tauri) for the desktop app. All major components are written in Rust.
+!!! note
+    Koharu runs its vision models and local LLMs **locally** on your machine by default. If you choose a remote LLM provider, Koharu sends translation text only to the provider you configured. Koharu itself does not collect user data.
+---
+![screenshot](assets/koharu-screenshot-en.png)
+!!! note
+    For help and support, please join our [Discord server](https://discord.gg/mHvHkxGnUY).
+## Start here
+- New to Koharu: [Translate Your First Page](tutorials/translate-your-first-page.md)
+- Installing a release build: [Install Koharu](how-to/install-koharu.md)
+- Running the desktop app, Web UI, or MCP server: [Run GUI, Headless, and MCP Modes](how-to/run-gui-headless-and-mcp.md)
+- Exporting images, PSDs, and project files: [Export Pages and Manage Projects](how-to/export-and-manage-projects.md)
+- Building from source: [Build From Source](how-to/build-from-source.md)
+## What Koharu can do
+- Detect and segment manga text regions automatically
+- Run OCR on manga pages
+- Inpaint original text from the artwork
+- Translate with local or remote LLMs
+- Render vertical text for CJK languages
+- Export layered PSD files with editable text
+- Expose an MCP server for AI-agent workflows
+## Learn the system
+- Workflow overview: [How Koharu Works](explanation/how-koharu-works.md)
+- GPU and fallback behavior: [Acceleration and Runtime](explanation/acceleration-and-runtime.md)
+- Vision models and LLM backends: [Models and Providers](explanation/models-and-providers.md)
+## Look up details
+- Command-line options: [CLI Reference](reference/cli.md)
+- Default controls: [Keyboard Shortcuts](reference/keyboard-shortcuts.md)

docs/reference/cli.md ADDED Viewed

	@@ -0,0 +1,47 @@

+---
+title: CLI Reference
+---
+# CLI Reference
+This page covers the command-line options exposed by Koharu's desktop binary.
+## Common usage
+```bash
+# macOS / Linux
+koharu [OPTIONS]
+# Windows
+koharu.exe [OPTIONS]
+```
+## Options
+| Option | Meaning |
+| --- | --- |
+| `-d`, `--download` | Download runtime libraries and exit |
+| `--cpu` | Force CPU mode even when a GPU is available |
+| `-p`, `--port <PORT>` | Bind the local HTTP server to a specific port |
+| `--headless` | Run without starting the desktop GUI |
+| `--debug` | Enable debug mode with console output |
+## Common patterns
+Start headless Web UI on a stable port:
+```bash
+koharu --port 4000 --headless
+```
+Start with CPU-only inference:
+```bash
+koharu --cpu
+```
+Download runtime packages ahead of time:
+```bash
+koharu --download
+```

docs/reference/index.md ADDED Viewed

	@@ -0,0 +1,12 @@

+---
+title: Reference
+---
+# Reference
+Reference pages collect factual details you may want to look up quickly.
+## Available references
+- [CLI Reference](cli.md)
+- [Keyboard Shortcuts](keyboard-shortcuts.md)

docs/reference/keyboard-shortcuts.md ADDED Viewed

	@@ -0,0 +1,13 @@

+---
+title: Keyboard Shortcuts
+---
+# Keyboard Shortcuts
+These are the default controls documented for the editor.
+| Shortcut | Action |
+| --- | --- |
+| `Ctrl` + mouse wheel | Zoom in or out |
+| `Ctrl` + drag | Pan the canvas |
+| `Del` | Delete the selected text block |

docs/tutorials/index.md ADDED Viewed

	@@ -0,0 +1,11 @@

+---
+title: Tutorials
+---
+# Tutorials
+Tutorials walk through complete tasks from start to finish.
+## Available tutorials
+- [Translate Your First Page](translate-your-first-page.md)

docs/tutorials/translate-your-first-page.md ADDED Viewed

	@@ -0,0 +1,72 @@

+---
+title: Translate Your First Page
+---
+# Translate Your First Page
+This tutorial covers the normal Koharu workflow for a single manga page: import, detect, recognize, translate, review, and export.
+## Before you begin
+- Install Koharu from the latest GitHub release
+- Start with a clear manga page image
+- Make sure you have enough local VRAM/RAM for your preferred model, or plan to use a remote provider
+If you have not installed Koharu yet, start with [Install Koharu](../how-to/install-koharu.md).
+## 1. Launch Koharu
+Open the desktop application normally.
+On the first run, Koharu may download required runtime packages and ML models. This is expected.
+## 2. Import a page
+Load your manga page into the app.
+Koharu keeps your work inside a project, and on Windows it can associate `.khr` project files so you can reopen them by double-clicking.
+## 3. Detect text and run OCR
+Use Koharu's built-in vision pipeline to:
+- detect speech bubbles and text regions
+- segment text areas
+- recognize the original text with OCR
+At this point, review the detected blocks and clean up anything obvious before translation.
+## 4. Choose a translation backend
+Pick either:
+- a local GGUF model if you want everything to stay on your machine
+- a remote provider if you want to avoid local model downloads or heavy local inference
+Koharu can use OpenAI, Gemini, Claude, DeepSeek, and OpenAI-compatible endpoints such as LM Studio or OpenRouter.
+## 5. Translate and review
+Run translation on the page, then inspect the result carefully.
+Koharu helps with text layout and vertical CJK rendering, but you should still review:
+- names and terminology
+- line breaks
+- font choices
+- bubble fit
+## 6. Export the result
+When the page looks right, export it as either:
+- a rendered image
+- a layered Photoshop PSD with editable text layers
+PSD export is useful when you want to do final cleanup in Photoshop without rebuilding the page structure by hand.
+## Next steps
+- Learn export options: [Export Pages and Manage Projects](../how-to/export-and-manage-projects.md)
+- Compare runtime choices: [Acceleration and Runtime](../explanation/acceleration-and-runtime.md)
+- Choose a model: [Models and Providers](../explanation/models-and-providers.md)

zensical.toml ADDED Viewed

	@@ -0,0 +1,111 @@

+[project]
+site_name = "koharu"
+site_description = "ML-powered manga translator, written in Rust."
+site_author = "Mayo"
+site_url = "https://koharu.rs/"
+repo_url = "https://github.com/mayocream/koharu"
+repo_name = "mayocream/koharu"
+edit_uri = "edit/main/docs/"
+docs_dir = "docs"
+nav = [
+  {"Overview" = "index.md"},
+  {"Tutorials" = [
+    "tutorials/index.md",
+    "tutorials/translate-your-first-page.md",
+  ]},
+  {"How-To Guides" = [
+    "how-to/index.md",
+    "how-to/install-koharu.md",
+    "how-to/run-gui-headless-and-mcp.md",
+    "how-to/export-and-manage-projects.md",
+    "how-to/build-from-source.md",
+  ]},
+  {"Explanation" = [
+    "explanation/index.md",
+    "explanation/how-koharu-works.md",
+    "explanation/acceleration-and-runtime.md",
+    "explanation/models-and-providers.md",
+  ]},
+  {"Reference" = [
+    "reference/index.md",
+    "reference/cli.md",
+    "reference/keyboard-shortcuts.md",
+  ]},
+]
+[project.extra]
+generator = false
+[[project.extra.social]]
+icon = "fontawesome/brands/x-twitter"
+link = "https://x.com/mayo_irl"
+[[project.extra.social]]
+icon = "fontawesome/brands/discord"
+link = "https://discord.gg/mHvHkxGnUY"
+[project.theme]
+language = "en"
+logo = "assets/Koharu_Halo.png"
+favicon = "assets/Koharu_Halo.png"
+font.text = "Nunito"
+font.code = "Fira Code"
+features = [
+  "navigation.sections",
+  "navigation.indexes",
+  "navigation.instant",
+  "navigation.tracking",
+  "navigation.tabs",
+  "navigation.tabs.sticky",
+  "navigation.expand",
+  "navigation.footer",
+  "toc.follow",
+  "content.code.copy",
+  "content.action.edit",
+  "content.action.view",
+  "content.tabs.link",
+]
+[project.theme.icon]
+repo = "fontawesome/brands/github"
+[project.theme.icon.admonition]
+question = "fontawesome/solid/paper-plane"
+user = "lucide/user-round"
+[[project.theme.palette]]
+scheme = "default"
+primary = "pink"
+accent = "teal"
+toggle.icon = "lucide/sun"
+toggle.name = "Switch to dark mode"
+[[project.theme.palette]]
+scheme = "slate"
+primary = "pink"
+accent = "teal"
+toggle.icon = "lucide/moon"
+toggle.name = "Switch to light mode"
+[project.markdown_extensions.admonition]
+[project.markdown_extensions.attr_list]
+[project.markdown_extensions.md_in_html]
+[project.markdown_extensions.tables]
+[project.markdown_extensions.pymdownx.emoji]
+emoji_index = "zensical.extensions.emoji.twemoji"
+emoji_generator = "zensical.extensions.emoji.to_svg"
+[project.markdown_extensions.pymdownx.tabbed]
+alternate_style = true
+[project.markdown_extensions.pymdownx.tasklist]
+custom_checkbox = true
+[project.markdown_extensions.toc]
+permalink = true
+[project.markdown_extensions.pymdownx.superfences]
+custom_fences = [
+  { name = "mermaid", class = "mermaid", format = "pymdownx.superfences.fence_code_format" },
+]