<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://tts.wiki/index.php?action=history&amp;feed=atom&amp;title=NeuCodec</id>
	<title>NeuCodec - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://tts.wiki/index.php?action=history&amp;feed=atom&amp;title=NeuCodec"/>
	<link rel="alternate" type="text/html" href="https://tts.wiki/index.php?title=NeuCodec&amp;action=history"/>
	<updated>2026-04-03T18:40:26Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.41.5</generator>
	<entry>
		<id>https://tts.wiki/index.php?title=NeuCodec&amp;diff=57&amp;oldid=prev</id>
		<title>Ttswikiadmin: Created page with &quot;&#039;&#039;&#039;NeuCodec&#039;&#039;&#039; is a neural audio codec developed by Neuphonic, designed for efficient speech tokenization and high-quality audio compression at relatively low bitrates.  === Technical Specifications ===  * &#039;&#039;&#039;Bitrate:&#039;&#039;&#039; 0.8 kbps * &#039;&#039;&#039;Output sample rate:&#039;&#039;&#039; 24 kHz * &#039;&#039;&#039;Frame rate:&#039;&#039;&#039; 50 Hz * &#039;&#039;&#039;Quantization:&#039;&#039;&#039; Finite Scalar Quantization (FSQ) with a single codebook  === Architecture === NeuCodec is largely based on extending the work of X-Codec 2.0. It e...&quot;</title>
		<link rel="alternate" type="text/html" href="https://tts.wiki/index.php?title=NeuCodec&amp;diff=57&amp;oldid=prev"/>
		<updated>2025-12-23T02:33:27Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;&amp;#039;&amp;#039;&amp;#039;NeuCodec&amp;#039;&amp;#039;&amp;#039; is a neural audio codec developed by &lt;a href=&quot;/index.php?title=Neuphonic&amp;amp;action=edit&amp;amp;redlink=1&quot; class=&quot;new&quot; title=&quot;Neuphonic (page does not exist)&quot;&gt;Neuphonic&lt;/a&gt;, designed for efficient speech tokenization and high-quality audio compression at relatively low bitrates.  === Technical Specifications ===  * &amp;#039;&amp;#039;&amp;#039;Bitrate:&amp;#039;&amp;#039;&amp;#039; 0.8 kbps * &amp;#039;&amp;#039;&amp;#039;Output sample rate:&amp;#039;&amp;#039;&amp;#039; 24 kHz * &amp;#039;&amp;#039;&amp;#039;Frame rate:&amp;#039;&amp;#039;&amp;#039; 50 Hz * &amp;#039;&amp;#039;&amp;#039;Quantization:&amp;#039;&amp;#039;&amp;#039; Finite Scalar Quantization (FSQ) with a single codebook  === Architecture === NeuCodec is largely based on extending the work of &lt;a href=&quot;/index.php/X-Codec&quot; title=&quot;X-Codec&quot;&gt;X-Codec 2.0&lt;/a&gt;. It e...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;NeuCodec&amp;#039;&amp;#039;&amp;#039; is a neural audio codec developed by [[Neuphonic]], designed for efficient speech tokenization and high-quality audio compression at relatively low bitrates.&lt;br /&gt;
&lt;br /&gt;
=== Technical Specifications ===&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Bitrate:&amp;#039;&amp;#039;&amp;#039; 0.8 kbps&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Output sample rate:&amp;#039;&amp;#039;&amp;#039; 24 kHz&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Frame rate:&amp;#039;&amp;#039;&amp;#039; 50 Hz&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Quantization:&amp;#039;&amp;#039;&amp;#039; Finite Scalar Quantization (FSQ) with a single codebook&lt;br /&gt;
&lt;br /&gt;
=== Architecture ===&lt;br /&gt;
NeuCodec is largely based on extending the work of [[X-Codec|X-Codec 2.0]]. It employs a dual-encoder approach, using both audio ([[BigCodec]]) and semantic (Wav2Vec2-BERT) encoders. The FSQ-based design produces a single quantized vector output, making it well-suited for downstream Speech Language Model (SpeechLM) training.&lt;br /&gt;
&lt;br /&gt;
=== Features ===&lt;br /&gt;
&lt;br /&gt;
* Compresses and reconstructs audio with near-inaudible reconstruction loss&lt;br /&gt;
* Upsamples from 16 kHz to 24 kHz&lt;br /&gt;
* Commercial use permitted&lt;br /&gt;
* Pre-encoded datasets available (Emilia-YODAS compressed from 1.7 TB to 41 GB)&lt;br /&gt;
&lt;br /&gt;
=== Applications ===&lt;br /&gt;
NeuCodec serves as the audio codec for [[NeuTTS Air]], Neuphonic&amp;#039;s on-device text-to-speech model with voice cloning capabilities. It&amp;#039;s intended for researchers and developers building text-to-speech systems who need efficient speech tokenization without developing their own codec.&lt;br /&gt;
&lt;br /&gt;
=== Availability ===&lt;br /&gt;
Available on Hugging Face and GitHub under the &amp;lt;code&amp;gt;neuphonic/neucodec&amp;lt;/code&amp;gt; repository, installable via pip.&lt;br /&gt;
&lt;br /&gt;
[[Category:Neural audio codecs]]&lt;/div&gt;</summary>
		<author><name>Ttswikiadmin</name></author>
	</entry>
</feed>