kazeia/scripts
Kazeia Team dafbe2a52b FULL NATIVE C++ TTS pipeline — any text, perfect quality
The complete solution for native TTS on NPU:
1. Python: tokenize + text_projection only (30ms, no model generation)
2. File: golden prefill[0:9] + text_proj + eos padding (ratio 3.5×)
3. C++ shared Module: codec_sum(our codes) + trailing text/eos/pad
4. RMS-based auto-trim of trailing noise after speech ends

Key insights:
- Shared Module C++ uses SAME QNN compiled graph as Java → self-consistent
- codec_sum from our NPU codes is coherent (same model instance)
- Text tokens consumed 1:1, then eos padding for remaining steps
- RMS trim detects 15% energy drop from peak → cuts garbage

Validated "impeccable" by user on "Bonjour, je m'appelle Kazeia..."
prepare_tts_native.py works for ANY text.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-09 23:39:06 +02:00
..
cp_et_runner.cpp Initial commit: Kazeia TTS pipeline on NPU via ExecuTorch 2026-04-09 08:42:11 +02:00
export_cp_pte.py Initial commit: Kazeia TTS pipeline on NPU via ExecuTorch 2026-04-09 08:42:11 +02:00
export_talker_pte.py Restore KV=100 + fix as-is embeds + multi-segment support 2026-04-09 22:26:20 +02:00
prepare_tts_embeds.py Add prepare_tts_embeds.py for any text + codec_sum fix 2026-04-09 14:05:42 +02:00
prepare_tts_native.py FULL NATIVE C++ TTS pipeline — any text, perfect quality 2026-04-09 23:39:06 +02:00
prepare_tts_segments.py Restore KV=100 + fix as-is embeds + multi-segment support 2026-04-09 22:26:20 +02:00
qc_schema_serialize_patched.py Initial commit: Kazeia TTS pipeline on NPU via ExecuTorch 2026-04-09 08:42:11 +02:00
test_cp_et_quality.py Initial commit: Kazeia TTS pipeline on NPU via ExecuTorch 2026-04-09 08:42:11 +02:00