mrfakename's picture

mrfakename PRO

mrfakename

·

https://mrfake.name/

AI & ML interests

LLMs, TTS, & Open Source

Articles

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

Organizations

mrfakename's activity

upvoted 2 papers 14 days ago

MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer

Paper • 2409.00750 • Published Sep 1 • 2

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion

Paper • 2409.10058 • Published Sep 16 • 1

upvoted a paper 27 days ago

YODAS: Youtube-Oriented Dataset for Audio and Speech

Paper • 2406.00899 • Published Jun 2 • 2

upvoted a paper 28 days ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Paper • 2410.06885 • Published Oct 9 • 40

upvoted 2 papers 3 months ago

Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15 • 55

Artist: Aesthetically Controllable Text-Driven Stylization without Training

Paper • 2407.15842 • Published Jul 22 • 13

upvoted a paper 5 months ago

Diffusion On Syntax Trees For Program Synthesis

Paper • 2405.20519 • Published May 30 • 1

upvoted 2 papers 6 months ago

"Teach AI How to Code": Using Large Language Models as Teachable Agents for Programming Education

Paper • 2309.14534 • Published Sep 25, 2023 • 2

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Paper • 2402.09844 • Published Feb 15 • 20

upvoted a collection 6 months ago

🎭 Avatars

The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 69 items • Updated 20 days ago • 76

upvoted an article 7 months ago

Article

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Jan 26, 2023

• 39

upvoted 2 papers 7 months ago

Better speech synthesis through scaling

Paper • 2305.07243 • Published May 12, 2023 • 5

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 251

upvoted 2 articles 7 months ago

Article

Mixture of Depth is Vibe

By

•

Apr 22

• 44

Article

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

Feb 27

• 34

upvoted 2 papers 7 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 84

Natural language guidance of high-fidelity text-to-speech with synthetic annotations

Paper • 2402.01912 • Published Feb 2 • 11

upvoted an article 7 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 183

upvoted 2 papers 7 months ago

Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 25

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26 • 78