-
db281002d9
scripts: export per-voice prefix/suffix embeddings
main
Kazeia Team
2026-04-15 00:09:23 +0200
-
c2f7859dfe
UI+TTS: voice hot-swap + typing dots + emoji stripping
Kazeia Team
2026-04-14 23:55:07 +0200
-
b5b13780f7
UI: whole-sphere Fourier-mode deformation during speech
Kazeia Team
2026-04-14 23:47:30 +0200
-
2fe46e0f15
Fix seg-2 audio dropout + switch spectrum from bars to Bézier lines
Kazeia Team
2026-04-14 23:42:43 +0200
-
06dcd76dcb
UI: large central orb w/ spectrum-inside + per-voice palette
Kazeia Team
2026-04-14 23:33:38 +0200
-
8939c680b2
UI: épuré audio-reactive orb visualizer — replaces 3D avatar for MVP
Kazeia Team
2026-04-14 23:20:15 +0200
-
f17131aefb
UI: reveal Kazeia reply in sync with TTS audio (per-sentence, per-word)
Kazeia Team
2026-04-14 22:58:18 +0200
-
6a958c1a10
Revert MemoryOptimizer — reclaim wasn't worth the footprint
Kazeia Team
2026-04-14 22:46:58 +0200
-
751e3e0868
memory: periodic sweep + expand kill list (photos, calendar, contacts, vending, tachyon…)
Kazeia Team
2026-04-14 22:44:09 +0200
-
39babcb158
TTS+audio+memory: ColorOS playback fixes + kill-background reclaim
Kazeia Team
2026-04-14 22:37:20 +0200
-
0632db1ee0
UI: drop Magisk prompt — ResourceMonitor stops probing su
Kazeia Team
2026-04-14 18:35:18 +0200
-
10fd10fd90
TTS: overlap CP↔BigVGAN — first audio 14.5s → 10.9s per segment
Kazeia Team
2026-04-14 16:22:15 +0200
-
67de8d4767
LLM: enable hybrid-mode export via num_sharding=1 — TTFT 2.9s → 113ms
Kazeia Team
2026-04-14 15:08:31 +0200
-
a41619ed67
TTS: keep BigVGAN on CPU after GPU regression; LLM filter strips more tags
Kazeia Team
2026-04-14 13:48:37 +0200
-
f4b15a72a7
LLM JNI: auto-detect eval_mode from .pte methods (kv-only vs hybrid)
Kazeia Team
2026-04-14 12:45:10 +0200
-
3d435f9cdd
LLM: trim system prompt to drop ~27 prefill tokens (-1.3s TTFT)
Kazeia Team
2026-04-14 12:16:11 +0200
-
7dc6704e95
docs: add before/after performance comparison to no-root report
Kazeia Team
2026-04-14 11:37:15 +0200
-
6c7746c5d0
docs: add post-mortem to no-root report — issue resolved
Kazeia Team
2026-04-14 11:19:27 +0200
-
b57719fa5e
LLM: filter <think> tokens out of the streaming TTS path
Kazeia Team
2026-04-14 11:16:08 +0200
-
f32b5ddfdd
LLM no-root: validate end-to-end pipeline, fix kv_io_bit_width detection
Kazeia Team
2026-04-14 11:11:23 +0200
-
809a6d4fed
LLM no-root: migrate to in-process LlmModule (JNI) — zero su calls
Kazeia Team
2026-04-14 10:39:50 +0200
-
6e6a2d9f82
Baseline before no-root migration: working state with root LLM
backup/pre-no-root-migration
Kazeia Team
2026-04-14 08:19:36 +0200
-
364016b7b8
LLM+TTS: short-response system prompt, PTE streaming fallback
Kazeia Team
2026-04-14 00:17:08 +0200
-
9930bfa392
LLM: enable Qwen3-4B NPU (21 tok/s) in service pipeline
Kazeia Team
2026-04-13 23:00:25 +0200
-
19f934af25
LLM NPU: Qwen3-4B QNN export patches + deployment notes
Kazeia Team
2026-04-13 22:56:42 +0200
-
f548e02283
TTS: dynamic EOS-rank boost terminates generation cleanly across voices
Kazeia Team
2026-04-13 14:13:04 +0200
-
c25040a780
TTS: conditional tail-trim + export script accepts voice path arg
Kazeia Team
2026-04-13 11:32:33 +0200
-
0833d1bd21
TTS: route all synthesizeAndPlay calls through Stage 3 streaming session
Kazeia Team
2026-04-13 11:12:14 +0200
-
2f07901ff3
TTS Stage 3: LLM stream → sentence split → TTS session → shared AudioTrack
Kazeia Team
2026-04-13 10:52:46 +0200
-
7f1a44c23d
TTS Stage 2: on-device voice-cloning TTS for arbitrary text
Kazeia Team
2026-04-13 10:12:09 +0200
-
5e416713ce
TTS Stage 1 streaming: play each segment the moment it's decoded
Kazeia Team
2026-04-13 08:43:30 +0200
-
de878ddf5c
TTS tremor investigation: identify cross-arch numerical floor, gate diag flags
Kazeia Team
2026-04-13 00:15:14 +0200
-
ee186e9049
Auto-segmentation for long texts + dynamic pipeline
Kazeia Team
2026-04-10 00:08:59 +0200
-
199bc4fbc9
Full native C++ TTS validated on short + long phrases
Kazeia Team
2026-04-09 23:51:05 +0200
-
dafbe2a52b
FULL NATIVE C++ TTS pipeline — any text, perfect quality
Kazeia Team
2026-04-09 23:39:06 +0200
-
09d36f2025
Root cause found + on-device embed capture + KV=100 restored
Kazeia Team
2026-04-09 23:00:37 +0200
-
3dcf73aa38
Restore KV=100 + fix as-is embeds + multi-segment support
Kazeia Team
2026-04-09 22:26:20 +0200
-
10a3904d7d
Multi-segment TTS for long text: split → generate → concatenate
Kazeia Team
2026-04-09 14:34:05 +0200
-
24157c0a68
Fix: use pre-computed embeds as-is (no double codec_sum)
Kazeia Team
2026-04-09 14:10:23 +0200
-
f6df1738c5
Add prepare_tts_embeds.py for any text + codec_sum fix
Kazeia Team
2026-04-09 14:05:42 +0200
-
173606dae7
Stable: decoder 8T optimization + restore pre-computed embeds
Kazeia Team
2026-04-09 13:42:02 +0200
-
42bbb96fd8
Optimize decoder: BigVGAN 8T, small models 4T → RTF 1.26
Kazeia Team
2026-04-09 13:00:05 +0200
-
a688edc9ec
Reduce talker KV_LEN 100→64: saves 148ms (RTF 1.31)
Kazeia Team
2026-04-09 12:47:30 +0200
-
4dcc4bb8b3
Fix KV buffer + revert HTP decoder (BigVGAN too complex for HTP)
Kazeia Team
2026-04-09 12:37:50 +0200
-
985fd9cff9
Direct output→input KV copy: RTF 1.51 → 1.31
Kazeia Team
2026-04-09 12:23:45 +0200
-
14f7e5b05f
Optimize CP+talker: eliminate prepare_input_tensors per step
Kazeia Team
2026-04-09 12:16:38 +0200
-
e647911329
Shared Module C++ pipeline: RTF 1.6 with perfect quality
Kazeia Team
2026-04-09 12:05:58 +0200
-
38c0e9874a
Disable C++ pipeline (QNN non-deterministic), keep Java RTF 1.8
Kazeia Team
2026-04-09 11:42:49 +0200
-
439629c9bf
Revert "Pre-allocate Tensor/EValue in Java pipeline: 16s → 8.9s (RTF 1.9)"
Kazeia Team
2026-04-09 11:03:52 +0200
-
0f027c5fde
Pre-allocate Tensor/EValue in Java pipeline: 16s → 8.9s (RTF 1.9)
Kazeia Team
2026-04-09 10:59:13 +0200
-
8e536094df
Fix C++ pipeline eos/pad + disable for quality (keep Java default)
Kazeia Team
2026-04-09 10:53:19 +0200
-
3b01302cfb
Fix missing eos/pad embeddings in native C++ pipeline
Kazeia Team
2026-04-09 10:35:05 +0200
-
393ce79eb5
Native C++ pipeline: RTF 1.4 (was 3.6 in Java)
Kazeia Team
2026-04-09 10:09:32 +0200
-
fb6045a635
Pre-load CP heads + GPU decoder test (reverted) + headArgmaxOffset
Kazeia Team
2026-04-09 09:57:01 +0200
-
6e6c562d53
Add DSP warmup + fix pipeline thread contention
Kazeia Team
2026-04-09 09:24:18 +0200
-
8bfe6c7445
Add NEON SIMD heads argmax for CP — 2.3× speedup
Kazeia Team
2026-04-09 08:55:20 +0200
-
389ffa7c61
Initial commit: Kazeia TTS pipeline on NPU via ExecuTorch
Kazeia Team
2026-04-09 08:42:11 +0200
Powered by TurnKey Linux.