
herimor/voxtream
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
Show notes
Hosted by Kana & Mari · 🇺🇸 US · JA · 101 episodes
Established thought leaders with verified media credentials.
Kana と Mari が、GitHub で見つけた TTS・MIDI・Audio など “音” にまつわる注目リポジトリを声で紹介。 音とコードが交差するオープンソースの世界を軽やかにナビゲートします。 Kana と Mari のプロフィールはこちら: Kana – Newbie Esports Caster Mari – Newbie Esports Analyst ※ 本番組の原稿は生成 AI を用いて自動生成されています。内容には誤りを含む可能性がありますので参考情報としてお楽しみください。
Kana & Mari hosts Kana & Mari’s SoundRepos, a technology show with 101 episodes published.

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
Show notes



:relaxed: One Shot Voice Cloning base on Unet-TTS
Show notes
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
Show notes
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
Show notes
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
Show notes
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and natural interruption h
Show notes
The hub for audio AI research: papers, open models, benchmarks & datasets across audio LLMs, speech recognition, TTS, music & audio generation.
Show notes
TTS-Story is a web-based multi‑voice TTS studio for turning tagged scripts into audiobooks—featuring full speaker management, chunk review/regeneration, a job queue and library system, and local GPU or API backends inclu
Show notes
[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!
Show notes

Official implementation of Meta-StyleSpeech and StyleSpeech
Show notes

gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。
Show notes

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
Show notes
一个能让 Bot 在私聊和群聊中发起主动消息的插件,拥有上下文感知、持久化数据、动态情绪、免打扰时段和 TTS 集成。还有独立 WebUI,可进行个性化配置。 An AstrBot plugin that enables Bot to send proactive messages in private and group chats, featuring context awareness, persistent data, dyna
Show notes
Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiobooks, and GPU acceleration.
Show notesSponsor detection runs nightly. Check back soon.
No public pitch examples yet for this show.
Generate your own personalised pitchBased on semantic analysis of episode topics and host coverage, this show is a strong guest fit for executives in:
Industry fit is computed by PitchCentric using vector embeddings of the show's episode catalog.
Shows with the most semantically similar episode content. Pitch one, pitch all; producers cluster.







Kana & Mari’s SoundRepos has a verified contact on file. Create a free PitchCentric account to access it and generate a personalised pitch in seconds. Research at least 3 recent episodes first and lead with a specific angle that serves their technology audience.
Kana & Mari’s SoundRepos is hosted by Kana & Mari. The show is categorised under technology and has published 101 episodes.
Kana & Mari’s SoundRepos has published 101 episodes.
Kana & Mari’s SoundRepos regularly covers technology. It sits in the technology category.
Kana & Mari’s SoundRepos is accessible for guests with genuine technology expertise. A personalised, episode-aware pitch will still outperform a generic one every time.
Kana & Mari’s SoundRepos hasn't explicitly signalled guest openness in recent episodes. That doesn't rule out pitching. your hook just needs to be especially compelling and relevant to their recent content.
Episodes of Kana & Mari’s SoundRepos average 2 minutes. a focused format where a clear narrative arc and tight preparation matter most.
Our data rates Kana & Mari’s SoundRepos's guest bar at 80/100 (Premium tier). Established thought leaders with verified media credentials. Sign in to PitchCentric to see how your own Pod Score compares against this show.
Methodology. Booking Probability™ blends Listen Score, 30-day Virality, open-to-guests detection, and Apple ratings. Data refreshed every 60 minutes. Listen Score and Booking Probability are calculated by PitchCentric. Last enriched 8 days ago.