OpenMOSS-Team/MOSS-Audio-Tokenizer-Nano · CNN-free causal transformer · 12.5 Hz frame rate · 32-layer RVQ · 3M hrs training data
OpenMOSS-Team/MOSS-Audio-Tokenizer-Nano