๐ต๐ฐ ุงุฑุฏู | ๐บ๐ธ English | ๐จ๐ณ Chinese | ๐ธ๐ฆ Arabic | ๐ฎ๐ณ Hindi | ๐ซ๐ท French | ๐ Hello
NZG73 Software Documentation & User Guidelines
Developer: Muhammad Noman | Company: NZG73
โ ๏ธ Strict Warning Against Misuse
๐จ NZG73 strictly instructs all users to use this software only for positive and constructive purposes.
Misusing someone's voice, targeting individuals, or causing harm to anyone's life is highly unethical and a grave offense.
We must fear the consequences of such actions and avoid them.
This tool is designed to simplify complex tasks; please use it responsibly. ๐ค
๐ Key Features
The system is divided into several advanced modules to ensure optimal performance for every task. Detailed descriptions of each feature are provided below:
๐ Step 1: TTS CPU (Text-to-Speech)
This module is specifically designed for users who do not have expensive Graphics Cards (GPUs).
| ๐ท๏ธ Feature | ๐ Description |
|---|---|
| ๐ Supported Languages | Primarily supports 2 languages (English and Chinese) |
| โก Performance | Runs smoothly on a CPU without any lag. No heavy GPU needed. |
| ๐๏ธ Voice Cloning | Includes a highly functional Voice Cloning feature (Strictly for ethical use only) |
๐ค Step 2: Voice Clone (Advanced)
An extremely powerful Voice Cloning module supporting a vast range of languages.
| ๐ท๏ธ Feature | ๐ Description |
|---|---|
| ๐ Supported Languages | Supports 600 different languages ๐ |
| ๐ป Hardware Requirements | Due to its heavy and advanced nature, it is very slow on a CPU |
| ๐ Performance | A GPU is mandatory for best results. Runs easily on cards with 4GB to 8GB VRAM. The more powerful the GPU, the faster the processing speed. |
๐ Step 3: Voice Clone Metadata (Special Feature)
๐ This is the most specialized and advanced part of the software!
Previously, users had to record audio and create text files separately for Voice Trainingโa process that took hours.
โจ "Dataset preparation is now easier than ever!" โจ
Instead of manually cutting hundreds of clips and naming them, this Automated System handles everything. You simply:
- โ๏ธ Enter your text
- ๐ฑ๏ธ Click a button
- ๐ง The system generates the voice
- ๐ Saves it in a numbered sequence in a folder
- ๐ Automatically updates your metadata file
๐ฏ Training a dataset is now child's play! ๐ง
๐ Step 4: Speech to Text (ASR)
This module converts audio into text with incredible accuracy.
| ๐ท๏ธ Feature | ๐ Description |
|---|---|
| ๐ Supported Languages | Supports 99 languages |
| ๐ป Performance & Hardware | This is a powerful model requiring at least a 3GB VRAM GPU. While it works excellently on a GPU, it is nearly impossible to use on a CPU due to extremely slow speeds. |
๐ Step 5: Voice to Voice
This feature converts one voice directly into another.
๐ง How it works:
Suppose you generated a story or audio using an AI model (Target Voice), but you want it to sound like your own voice. You provide:
- ๐ฏ The Target Voice (AI generated audio)
- ๐๏ธ A 30-second reference clip of your Original Voice
After clicking Generate, the entire audio is transformed into your voice! ๐
โก Performance: Somewhat slow on a CPU but functional. However, on a GPU, it processes with fluid speed. ๐
๐ Upcoming Releases
The journey of NZG73 doesn't end here. We are bringing you more exciting updates:
๐ฑ N Preva (Mobile App)
Our mobile app, N Preva, will be released soon, bringing powerful AI features directly to your smartphone.
๐ Advanced Web UI
A new, advanced Web UI is in development. It will support multiple AI models, allowing you to interact with various AI systems and handle different tasks simultaneously.
๐ ๏ธ Powered By ๐ ๏ธ
๐ Connect With Me ๐
๐ GitHub Stats ๐
โจ Made with โค๏ธ by Muhammad Noman | NZG73 โจ
โญ If you like this project, don't forget to give it a star! โญ