🕵️ NeMo Anonymizer: Detect and protect PII through context-aware replacement and rewriting
-
Updated
Jul 3, 2026 - Python
🕵️ NeMo Anonymizer: Detect and protect PII through context-aware replacement and rewriting
The awesome collection of Nvidia NemoClaw presets, recipes, and playbooks for sandboxed OpenClaw operations.
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
Free AI & Community powered Learning Experience
Nemotron ASR rewrite to GGML
LLM tutorial materials include but not limited to NVIDIA NeMo, TensorRT-LLM, Triton Inference Server, and NeMo Guardrails.
Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine
This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.
Post-training quantization on Nvidia Nemo ASR model
Extractive Question-Answering with BERT on SQuAD v2.0 (Stanford Question Answering Dataset) using NVIDIA PyTorch Lightning
📄 SmartSRT is a command-line tool for generating accurate subtitles with per-word timestamps. It uses WhisperAI for speech transcription, NVIDIA NeMo for diarization, and OpenCV for face recognition. The program is good at creating high accuracy subtitles. 🎧💻⚙️
Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to transcribe speech to text in real-time from any source. Requires CUDA capable GPU to run on the local machine, if setup using virtual audio cables can transcribe the audio that is being played in real-time without any other requirements.
The simplest & most comprehensible tutorial on speaker identification with NVIDIA's `Nemo`.
Extremely fast and accurate audio transcriber surpassing Whisper. Fast on GPU or CPU.
🤖 NemoClawd — NVIDIA NeMo Agent Toolkit + ClawdBot in one bridge-first AI workspace. TypeScript orchestrates NeMo Python workflows. Enterprise AI Digital Project Manager.
Multi-agent reference app combining NVIDIA NeMo (Python) and Microsoft Agent Framework (.NET) over A2A, with a Razor web chat UI and Aspire-based orchestration.
Swift library for Speaker Embedding extraction and verification using NVIDIA NeMo TitaNet model converted to CoreML. Extract 192-dim speaker embeddings, verify speakers, and perform real-time speaker diarization on iOS/macOS.
Swift library for Voice Activity Detection (VAD) using NVIDIA NeMo MarbleNet model converted to CoreML. Detect speech segments in real-time on iOS/macOS with high accuracy and low latency.
This bootcamp is designed to give NLP researchers an end-to-end overview on the fundamentals of NVIDIA NeMo framework, complete solution for building large language models. It will also have hands-on exercises complimented by tutorials, code snippets, and presentations to help researchers kick-start with NeMo LLM Service and Guardrails.
Local-first developer dashboard for the NVIDIA DGX Spark.
Add a description, image, and links to the nvidia-nemo topic page so that developers can more easily learn about it.
To associate your repository with the nvidia-nemo topic, visit your repo's landing page and select "manage topics."