Qwen3-ForcedAligner-0.6B (Q4 ONNX) β Trackdub Mirror
Q4 ONNX export of Qwen/Qwen3-ForcedAligner-0.6B, sourced from aloomba/Qwen3-ForcedAligner-0.6B-ONNX.
Hosted here for stable, checksum-verified distribution by Trackdub (local-first AI dubbing workstation).
Model
- Architecture: Encoder (24L, d=1024) + 28L decoder body, non-autoregressive
- Output: Word-level timestamps via slot injection; 5000-class softmax Γ 80ms resolution
- Max audio duration: 5 minutes
- Inference: Single forward pass
- Languages: 11 (English, Chinese, Cantonese, French, German, Italian, Japanese, Korean, Portuguese, Russian, Spanish)
Files
| File | Size | SHA256 |
|---|---|---|
| onnx/model_q4.onnx | ~1000 MB | 59b528896d70b34e57838e160d16d5f7cfc02d86c7c6ad46cdc57c25c15497b7 |
| config.json | β | β |
| okenizer.json | β | β |
| ocab.json | β | β |
| merges.txt | β | β |
| preprocessor_config.json | β | β |
| okenizer_config.json | β | β |
| special_tokens_map.json | β | β |
| dded_tokens.json | β | β |
Note: model_q4.onnx is self-contained (no external data sidecar required).
Usage in Trackdub
Used as the primary forced alignment provider in Trackdub's lip-sync stage (ngine_family: onnx-qwen-forced-aligner). Not intended for standalone use from this mirror.
License
Apache 2.0 β see original model card.
- Downloads last month
- 17
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support