痛点分析发布于 2026/05/31
痛点为 AI 基于上游原始证据的初步提炼;未包含额外中国市场检索。
痛点
在语音合成领域,现有TTS系统通常依赖分词器(tokenizer)进行文本到语音的转换,这会导致音质损失、多语言支持困难以及语音克隆不够自然。用户(如开发者、内容创作者)在生成多语言语音、设计创意声音或进行高保真语音克隆时,需要处理复杂的预处理流程、调整参数以弥补分词器带来的失真,且难以实现跨语言的流畅切换。这种技术瓶颈造成了开发效率低下、生成质量不稳定,以及创意声音设计上的限制,增加了迭代时间和试错成本。
README
README summary
<h2 align="center" VoxCPM2: Tokenizer Free TTS for Multilingual Speech Generation, Creative Voice Design, and True to Life Cloning</h2 <p align="center" <b English</b <a href="./README_zh.md" 中文</a
Repository
Repository metadata
- Owner
- OpenBMB
- License
- Apache-2.0
- Default branch
- main
- Days since created
- 257
- Days since last push
- 9
Signals
Repository signals
- Homepage
- https://voxcpm.com
- Watchers
- 23,421
- Open issues
- 107
Topics
audiodeeplearningminicpmmultilingualpythonpytorchspeechspeech-synthesistext-to-speechttstts-modelvoice-cloningvoice-designvoxcpm
Contributors
Contributor snapshot
- Contributor count
- 10
- Top contributor share
- 0.329
- Top contributors
- Labmem-Zhouyx, liuxin99, a710128, VoxInstruct, MuyleangIng
源数据· Raw Archive
- source
- GitHub Trending
- upstream_source
- github_trending
- upstream_item_id
- OpenBMB--VoxCPM
- daily_ranking_item_id
- 8c940d11-8c63-4db2-9a88-926b743614ce
- rank_date
- 2026-06-01
- rank
- 7
- name
- VoxCPM
- tagline
- VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
- description
- VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
- votes_count
- 23,421
- source_url
- https://github.com/OpenBMB/VoxCPM
- website_url
- https://voxcpm.com
- thumbnail_url
- https://github.com/OpenBMB.png
- og_image_url
- https://github.com/OpenBMB.png
topics
audiodeeplearningminicpmmultilingualpythonpytorchspeechspeech-synthesistext-to-speechttstts-modelvoice-cloningvoice-designvoxcpm
media / source-specific data
{
"topics": [
"audio",
"deeplearning",
"minicpm",
"multilingual",
"python",
"pytorch",
"speech",
"speech-synthesis",
"text-to-speech",
"tts",
"tts-model",
"voice-cloning",
"voice-design",
"voxcpm"
],
"is_fork": false,
"license": "Apache-2.0",
"language": "Python",
"owner_type": "Organization",
"forks_total": 2710,
"has_funding": false,
"is_archived": false,
"owner_login": "OpenBMB",
"stars_today": 639,
"stars_total": 23421,
"homepage_url": "https://voxcpm.com",
"default_branch": "main",
"last_pushed_at": "2026-05-22T03:27:10Z",
"readme_summary": "<h2 align=\"center\" VoxCPM2: Tokenizer Free TTS for Multilingual Speech Generation, Creative Voice Design, and True to Life Cloning</h2 <p align=\"center\" <b English</b <a href=\"./README_zh.md\" 中文</a",
"repo_full_name": "OpenBMB/VoxCPM",
"watchers_count": 23421,
"last_updated_at": "2026-05-31T21:58:21Z",
"top_contributors": [
{
"login": "Labmem-Zhouyx",
"contributions": 25
},
{
"login": "liuxin99",
"contributions": 13
},
{
"login": "a710128",
"contributions": 12
},
{
"login": "VoxInstruct",
"contributions": 11
},
{
"login": "MuyleangIng",
"contributions": 4
}
],
"contributor_count": 10,
"funding_platforms": [],
"open_issues_count": 107,
"days_since_created": 257,
"created_at_on_source": "2025-09-16T03:41:49Z",
"days_since_last_push": 9,
"top_contributor_share": 0.329
}raw_payload
{
"fetched_at": "2026-05-31T22:00:31.337Z",
"trending_repo": {
"url": "https://github.com/OpenBMB/VoxCPM",
"name": "VoxCPM",
"rank": 7,
"forks": 2710,
"owner": "OpenBMB",
"stars": 23421,
"fullName": "OpenBMB/VoxCPM",
"language": "Python",
"avatarUrl": "https://github.com/OpenBMB.png",
"rawSummary": "<div class=\"float-right d-flex\">\n\n <div data-view-component=\"true\" class=\"BtnGroup d-flex\">\n <a href=\"/login?return_to=%2FOpenBMB%2FVoxCPM\" rel=\"nofollow\" data-hydro-click=\"{"event_type":"authentication.click","payload":{"location_in_page":"star button","repository_id":1057664813,"auth_type":"LOG_IN","originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"a1fe12b5d683e0c15d327c1c8d150db53de98c3f97a3e53700969cdba86948bc\" aria-label=\"You must be signed in to star a repository\" data-view-component=\"true\" class=\"tooltipped tooltipped-sw btn-sm btn\"> <svg aria-hidden=\"true\" data-component=\"Octicon\" height=\"16\" viewBox=\"0 0 16 16\" version=\"1.1\" width=\"16\" data-view-component=\"true\" class=\"octicon octicon-star v-align-text-bottom d-none d-md-inline-block mr-2 tmp-mr-2\">\n <path d=\"M8 .25a.75.75 0 0 1 .673.418l1.882 3.815 4.21.612a.75.75 0 0 1 .416 1.279l-3.046 2.97.719 4.192a.751.751 0 0 1-1.088.791L8 12.347l-3.766 1.98a.75.75 0 0 1-1.088-.79l.72-4.194L.818 6.374a.75.75 0 0 1 .416-1.28l4.21-.611L7.327.668A.75.75 0 0 1 8 .25Zm0 2.445L6.615 5.5a.75.75 0 0 1-.564.41l-3.097.45 2.24 2.184a.75.75 0 0 1 .216.664l-.528 3.084 2.769-1.456a.75.75 0 0 1 .698 0l2.77 1.456-.53-3.084a.75.75 0 0 1 .216-.664l2.24-2.183-3.096-.45a.75.75 0 0 1-.564-.41L8 2.694Z\"></path>\n</svg><svg aria-hidden=\"true\" data-component=\"Octicon\" height=\"16\" viewBox=\"0 0 16 16\" version=\"1.1\" width=\"16\" data-view-component=\"true\" class=\"octicon octicon-star mr-0 tmp-mr-0 v-align-text-bottom d-inline-block d-md-none\">\n <path d=\"M8 .25a.75.75 0 0 1 .673.418l1.882 3.815 4.21.612a.75.75 0 0 1 .416 1.279l-3.046 2.97.719 4.192a.751.751 0 0 1-1.088.791L8 12.347l-3.766 1.98a.75.75 0 0 1-1.088-.79l.72-4.194L.818 6.374a.75.75 0 0 1 .416-1.28l4.21-.611L7.327.668A.75.75 0 0 1 8 .25Zm0 2.445L6.615 5.5a.75.75 0 0 1-.564.41l-3.097.45 2.24 2.184a.75.75 0 0 1 .216.664l-.528 3.084 2.769-1.456a.75.75 0 0 1 .698 0l2.77 1.456-.53-3.084a.75.75 0 0 1 .216-.664l2.24-2.183-3.096-.45a.75.75 0 0 1-.564-.41L8 2.694Z\"></path>\n</svg>\n <span data-view-component=\"true\" class=\"d-none d-md-inline\">\n Star\n</span>\n</a></div>\n </div>\n\n <h2 class=\"h3 lh-condensed\">\n <a data-hydro-click=\"{"event_type":"explore.click","payload":{"click_context":"TRENDING_REPOSITORIES_PAGE","click_target":"REPOSITORY","click_visual_representation":"REPOSITORY_NAME_HEADING","actor_id":null,"record_id":1057664813,"originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"a75e8ea62da569f4789f875caa24b61fde2bba68219914a6c2e748b447965576\" href=\"/OpenBMB/VoxCPM\" data-view-component=\"true\" class=\"Link\"><svg aria-hidden=\"true\" data-component=\"Octicon\" height=\"16\" viewBox=\"0 0 16 16\" version=\"1.1\" width=\"16\" data-view-component=\"true\" class=\"octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted\">\n <path d=\"M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z\"></path>\n</svg>\n\n <span data-view-component=\"true\" class=\"text-normal\">\n OpenBMB /\n</span>\n VoxCPM</a> </h2>\n\n <p class=\"col-9 color-fg-muted my-1 tmp-pr-4\">\n VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning\n </p>\n\n <div class=\"f6 color-fg-muted mt-2\">\n <span class=\"tmp-mr-3 d-inline-block ml-0 tmp-ml-0\">\n <span class=\"repo-language-color\" style=\"background-color: #3572A5\"></span>\n <span itemprop=\"programmingLanguage\">Python</span>\n</span>\n\n\n <a href=\"/OpenBMB/VoxCPM/stargazers\" data-view-component=\"true\" class=\"tmp-mr-3 Link Link--muted d-inline-block\"><svg aria-label=\"star\" role=\"img\" data-component=\"Octicon\" height=\"16\" viewBox=\"0 0 16 16\" version=\"1.1\" width=\"16\" data-view-component=\"true\" class=\"octicon octicon-star\">\n <path d=\"M8 .25a.75.75 0 0 1 .673.418l1.882 3.815 4.21.612a.75.75 0 0 1 .416 1.279l-3.046 2.97.719 4.192a.751.751 0 0 1-1.088.791L8 12.347l-3.766 1.98a.75.75 0 0 1-1.088-.79l.72-4.194L.818 6.374a.75.75 0 0 1 .416-1.28l4.21-.611L7.327.668A.75.75 0 0 1 8 .25Zm0 2.445L6.615 5.5a.75.75 0 0 1-.564.41l-3.097.45 2.24 2.184a.75.75 0 0 1 .216.664l-.528 3.084 2.769-1.456a.75.75 0 0 1 .698 0l2.77 1.456-.53-3.084a.75.75 0 0 1 .216-.664l2.24-2.183-3.096-.45a.75.75 0 0 1-.564-.41L8 2.694Z\"></path>\n</svg>\n 23,421</a>\n <a href=\"/OpenBMB/VoxCPM/forks\" data-view-component=\"true\" class=\"tmp-mr-3 Link Link--muted d-inline-block\"><svg aria-label=\"fork\" role=\"img\" data-component=\"Octicon\" height=\"16\" viewBox=\"0 0 16 16\" version=\"1.1\" width=\"16\" data-view-component=\"true\" class=\"octicon octicon-repo-forked\">\n <path d=\"M5 5.372v.878c0 .414.336.75.75.75h4.5a.75.75 0 0 0 .75-.75v-.878a2.25 2.25 0 1 1 1.5 0v.878a2.25 2.25 0 0 1-2.25 2.25h-1.5v2.128a2.251 2.251 0 1 1-1.5 0V8.5h-1.5A2.25 2.25 0 0 1 3.5 6.25v-.878a2.25 2.25 0 1 1 1.5 0ZM5 3.25a.75.75 0 1 0-1.5 0 .75.75 0 0 0 1.5 0Zm6.75.75a.75.75 0 1 0 0-1.5.75.75 0 0 0 0 1.5Zm-3 8.75a.75.75 0 1 0-1.5 0 .75.75 0 0 0 1.5 0Z\"></path>\n</svg>\n 2,710</a>\n <span data-view-component=\"true\" class=\"tmp-mr-3 d-inline-block\">\n Built by\n\n <a class=\"d-inline-block\" data-hydro-click=\"{"event_type":"explore.click","payload":{"click_context":"TRENDING_REPOSITORIES_PAGE","click_target":"CONTRIBUTING_DEVELOPER","click_visual_representation":"DEVELOPER_AVATAR","actor_id":null,"record_id":null,"originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"1624b778276e789a4b952f4f024b178ca5826544f935e57298c1d5b6ad8c3665\" data-hovercard-type=\"user\" data-hovercard-url=\"/users/Labmem-Zhouyx/hovercard\" data-octo-click=\"hovercard-link-click\" data-octo-dimensions=\"link_type:self\" href=\"/Labmem-Zhouyx\"><img class=\"avatar mb-1 avatar-user\" src=\"https://avatars.githubusercontent.com/u/63361337?s=40&v=4\" width=\"20\" height=\"20\" alt=\"@Labmem-Zhouyx\" /></a>\n <a class=\"d-inline-block\" data-hydro-click=\"{"event_type":"explore.click","payload":{"click_context":"TRENDING_REPOSITORIES_PAGE","click_target":"CONTRIBUTING_DEVELOPER","click_visual_representation":"DEVELOPER_AVATAR","actor_id":null,"record_id":null,"originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"1624b778276e789a4b952f4f024b178ca5826544f935e57298c1d5b6ad8c3665\" data-hovercard-type=\"user\" data-hovercard-url=\"/users/a710128/hovercard\" data-octo-click=\"hovercard-link-click\" data-octo-dimensions=\"link_type:self\" href=\"/a710128\"><img class=\"avatar mb-1 avatar-user\" src=\"https://avatars.githubusercontent.com/u/8132992?s=40&v=4\" width=\"20\" height=\"20\" alt=\"@a710128\" /></a>\n <a class=\"d-inline-block\" data-hydro-click=\"{"event_type":"explore.click","payload":{"click_context":"TRENDING_REPOSITORIES_PAGE","click_target":"CONTRIBUTING_DEVELOPER","click_visual_representation":"DEVELOPER_AVATAR","actor_id":null,"record_id":null,"originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"1624b778276e789a4b952f4f024b178ca5826544f935e57298c1d5b6ad8c3665\" data-hovercard-type=\"user\" data-hovercard-url=\"/users/liuxin99/hovercard\" data-octo-click=\"hovercard-link-click\" data-octo-dimensions=\"link_type:self\" href=\"/liuxin99\"><img class=\"avatar mb-1 avatar-user\" src=\"https://avatars.githubusercontent.com/u/33060143?s=40&v=4\" width=\"20\" height=\"20\" alt=\"@liuxin99\" /></a>\n <a class=\"d-inline-block\" data-hydro-click=\"{"event_type":"explore.click","payload":{"click_context":"TRENDING_REPOSITORIES_PAGE","click_target":"CONTRIBUTING_DEVELOPER","click_visual_representation":"DEVELOPER_AVATAR","actor_id":null,"record_id":null,"originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"1624b778276e789a4b952f4f024b178ca5826544f935e57298c1d5b6ad8c3665\" data-hovercard-type=\"user\" data-hovercard-url=\"/users/VoxInstruct/hovercard\" data-octo-click=\"hovercard-link-click\" data-octo-dimensions=\"link_type:self\" href=\"/VoxInstruct\"><img class=\"avatar mb-1 avatar-user\" src=\"https://avatars.githubusercontent.com/u/167593451?s=40&v=4\" width=\"20\" height=\"20\" alt=\"@VoxInstruct\" /></a>\n <a class=\"d-inline-block\" data-hydro-click=\"{"event_type":"explore.click","payload":{"click_context":"TRENDING_REPOSITORIES_PAGE","click_target":"CONTRIBUTING_DEVELOPER","click_visual_representation":"DEVELOPER_AVATAR","actor_id":null,"record_id":null,"originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"1624b778276e789a4b952f4f024b178ca5826544f935e57298c1d5b6ad8c3665\" data-hovercard-type=\"user\" data-hovercard-url=\"/users/SuperMarioYL/hovercard\" data-octo-click=\"hovercard-link-click\" data-octo-dimensions=\"link_type:self\" href=\"/SuperMarioYL\"><img class=\"avatar mb-1 avatar-user\" src=\"https://avatars.githubusercontent.com/u/20982600?s=40&v=4\" width=\"20\" height=\"20\" alt=\"@SuperMarioYL\" /></a>\n</span>\n <span data-view-component=\"true\" class=\"d-inline-block float-sm-right\">\n <svg aria-hidden=\"true\" data-component=\"Octicon\" height=\"16\" viewBox=\"0 0 16 16\" version=\"1.1\" width=\"16\" data-view-component=\"true\" class=\"octicon octicon-star\">\n <path d=\"M8 .25a.75.75 0 0 1 .673.418l1.882 3.815 4.21.612a.75.75 0 0 1 .416 1.279l-3.046 2.97.719 4.192a.751.751 0 0 1-1.088.791L8 12.347l-3.766 1.98a.75.75 0 0 1-1.088-.79l.72-4.194L.818 6.374a.75.75 0 0 1 .416-1.28l4.21-.611L7.327.668A.75.75 0 0 1 8 .25Zm0 2.445L6.615 5.5a.75.75 0 0 1-.564.41l-3.097.45 2.24 2.184a.75.75 0 0 1 .216.664l-.528 3.084 2.769-1.456a.75.75 0 0 1 .698 0l2.77 1.456-.53-3.084a.75.75 0 0 1 .216-.664l2.24-2.183-3.096-.45a.75.75 0 0 1-.564-.41L8 2.694Z\"></path>\n</svg>\n 639 stars today\n</span> </div>",
"starsToday": 639,
"description": "VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning"
},
"snapshot_version": "github_trending_v2"
}source_raw_snapshot
{
"id": "29a441ec-9e0c-4c7b-897f-7f1486bec46d",
"daily_ranking_item_id": "8c940d11-8c63-4db2-9a88-926b743614ce",
"source": "github_trending",
"external_id": "OpenBMB--VoxCPM",
"fetched_at": "2026-05-31T22:00:31.337Z",
"trending_page_raw": {
"url": "https://github.com/OpenBMB/VoxCPM",
"name": "VoxCPM",
"rank": 7,
"forks": 2710,
"owner": "OpenBMB",
"stars": 23421,
"language": "Python",
"full_name": "OpenBMB/VoxCPM",
"avatar_url": "https://github.com/OpenBMB.png",
"description": "VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning",
"raw_summary": "<div class=\"float-right d-flex\">\n\n <div data-view-component=\"true\" class=\"BtnGroup d-flex\">\n <a href=\"/login?return_to=%2FOpenBMB%2FVoxCPM\" rel=\"nofollow\" data-hydro-click=\"{"event_type":"authentication.click","payload":{"location_in_page":"star button","repository_id":1057664813,"auth_type":"LOG_IN","originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"a1fe12b5d683e0c15d327c1c8d150db53de98c3f97a3e53700969cdba86948bc\" aria-label=\"You must be signed in to star a repository\" data-view-component=\"true\" class=\"tooltipped tooltipped-sw btn-sm btn\"> <svg aria-hidden=\"true\" data-component=\"Octicon\" height=\"16\" viewBox=\"0 0 16 16\" version=\"1.1\" width=\"16\" data-view-component=\"true\" class=\"octicon octicon-star v-align-text-bottom d-none d-md-inline-block mr-2 tmp-mr-2\">\n <path d=\"M8 .25a.75.75 0 0 1 .673.418l1.882 3.815 4.21.612a.75.75 0 0 1 .416 1.279l-3.046 2.97.719 4.192a.751.751 0 0 1-1.088.791L8 12.347l-3.766 1.98a.75.75 0 0 1-1.088-.79l.72-4.194L.818 6.374a.75.75 0 0 1 .416-1.28l4.21-.611L7.327.668A.75.75 0 0 1 8 .25Zm0 2.445L6.615 5.5a.75.75 0 0 1-.564.41l-3.097.45 2.24 2.184a.75.75 0 0 1 .216.664l-.528 3.084 2.769-1.456a.75.75 0 0 1 .698 0l2.77 1.456-.53-3.084a.75.75 0 0 1 .216-.664l2.24-2.183-3.096-.45a.75.75 0 0 1-.564-.41L8 2.694Z\"></path>\n</svg><svg aria-hidden=\"true\" data-component=\"Octicon\" height=\"16\" viewBox=\"0 0 16 16\" version=\"1.1\" width=\"16\" data-view-component=\"true\" class=\"octicon octicon-star mr-0 tmp-mr-0 v-align-text-bottom d-inline-block d-md-none\">\n <path d=\"M8 .25a.75.75 0 0 1 .673.418l1.882 3.815 4.21.612a.75.75 0 0 1 .416 1.279l-3.046 2.97.719 4.192a.751.751 0 0 1-1.088.791L8 12.347l-3.766 1.98a.75.75 0 0 1-1.088-.79l.72-4.194L.818 6.374a.75.75 0 0 1 .416-1.28l4.21-.611L7.327.668A.75.75 0 0 1 8 .25Zm0 2.445L6.615 5.5a.75.75 0 0 1-.564.41l-3.097.45 2.24 2.184a.75.75 0 0 1 .216.664l-.528 3.084 2.769-1.456a.75.75 0 0 1 .698 0l2.77 1.456-.53-3.084a.75.75 0 0 1 .216-.664l2.24-2.183-3.096-.45a.75.75 0 0 1-.564-.41L8 2.694Z\"></path>\n</svg>\n <span data-view-component=\"true\" class=\"d-none d-md-inline\">\n Star\n</span>\n</a></div>\n </div>\n\n <h2 class=\"h3 lh-condensed\">\n <a data-hydro-click=\"{"event_type":"explore.click","payload":{"click_context":"TRENDING_REPOSITORIES_PAGE","click_target":"REPOSITORY","click_visual_representation":"REPOSITORY_NAME_HEADING","actor_id":null,"record_id":1057664813,"originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"a75e8ea62da569f4789f875caa24b61fde2bba68219914a6c2e748b447965576\" href=\"/OpenBMB/VoxCPM\" data-view-component=\"true\" class=\"Link\"><svg aria-hidden=\"true\" data-component=\"Octicon\" height=\"16\" viewBox=\"0 0 16 16\" version=\"1.1\" width=\"16\" data-view-component=\"true\" class=\"octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted\">\n <path d=\"M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z\"></path>\n</svg>\n\n <span data-view-component=\"true\" class=\"text-normal\">\n OpenBMB /\n</span>\n VoxCPM</a> </h2>\n\n <p class=\"col-9 color-fg-muted my-1 tmp-pr-4\">\n VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning\n </p>\n\n <div class=\"f6 color-fg-muted mt-2\">\n <span class=\"tmp-mr-3 d-inline-block ml-0 tmp-ml-0\">\n <span class=\"repo-language-color\" style=\"background-color: #3572A5\"></span>\n <span itemprop=\"programmingLanguage\">Python</span>\n</span>\n\n\n <a href=\"/OpenBMB/VoxCPM/stargazers\" data-view-component=\"true\" class=\"tmp-mr-3 Link Link--muted d-inline-block\"><svg aria-label=\"star\" role=\"img\" data-component=\"Octicon\" height=\"16\" viewBox=\"0 0 16 16\" version=\"1.1\" width=\"16\" data-view-component=\"true\" class=\"octicon octicon-star\">\n <path d=\"M8 .25a.75.75 0 0 1 .673.418l1.882 3.815 4.21.612a.75.75 0 0 1 .416 1.279l-3.046 2.97.719 4.192a.751.751 0 0 1-1.088.791L8 12.347l-3.766 1.98a.75.75 0 0 1-1.088-.79l.72-4.194L.818 6.374a.75.75 0 0 1 .416-1.28l4.21-.611L7.327.668A.75.75 0 0 1 8 .25Zm0 2.445L6.615 5.5a.75.75 0 0 1-.564.41l-3.097.45 2.24 2.184a.75.75 0 0 1 .216.664l-.528 3.084 2.769-1.456a.75.75 0 0 1 .698 0l2.77 1.456-.53-3.084a.75.75 0 0 1 .216-.664l2.24-2.183-3.096-.45a.75.75 0 0 1-.564-.41L8 2.694Z\"></path>\n</svg>\n 23,421</a>\n <a href=\"/OpenBMB/VoxCPM/forks\" data-view-component=\"true\" class=\"tmp-mr-3 Link Link--muted d-inline-block\"><svg aria-label=\"fork\" role=\"img\" data-component=\"Octicon\" height=\"16\" viewBox=\"0 0 16 16\" version=\"1.1\" width=\"16\" data-view-component=\"true\" class=\"octicon octicon-repo-forked\">\n <path d=\"M5 5.372v.878c0 .414.336.75.75.75h4.5a.75.75 0 0 0 .75-.75v-.878a2.25 2.25 0 1 1 1.5 0v.878a2.25 2.25 0 0 1-2.25 2.25h-1.5v2.128a2.251 2.251 0 1 1-1.5 0V8.5h-1.5A2.25 2.25 0 0 1 3.5 6.25v-.878a2.25 2.25 0 1 1 1.5 0ZM5 3.25a.75.75 0 1 0-1.5 0 .75.75 0 0 0 1.5 0Zm6.75.75a.75.75 0 1 0 0-1.5.75.75 0 0 0 0 1.5Zm-3 8.75a.75.75 0 1 0-1.5 0 .75.75 0 0 0 1.5 0Z\"></path>\n</svg>\n 2,710</a>\n <span data-view-component=\"true\" class=\"tmp-mr-3 d-inline-block\">\n Built by\n\n <a class=\"d-inline-block\" data-hydro-click=\"{"event_type":"explore.click","payload":{"click_context":"TRENDING_REPOSITORIES_PAGE","click_target":"CONTRIBUTING_DEVELOPER","click_visual_representation":"DEVELOPER_AVATAR","actor_id":null,"record_id":null,"originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"1624b778276e789a4b952f4f024b178ca5826544f935e57298c1d5b6ad8c3665\" data-hovercard-type=\"user\" data-hovercard-url=\"/users/Labmem-Zhouyx/hovercard\" data-octo-click=\"hovercard-link-click\" data-octo-dimensions=\"link_type:self\" href=\"/Labmem-Zhouyx\"><img class=\"avatar mb-1 avatar-user\" src=\"https://avatars.githubusercontent.com/u/63361337?s=40&v=4\" width=\"20\" height=\"20\" alt=\"@Labmem-Zhouyx\" /></a>\n <a class=\"d-inline-block\" data-hydro-click=\"{"event_type":"explore.click","payload":{"click_context":"TRENDING_REPOSITORIES_PAGE","click_target":"CONTRIBUTING_DEVELOPER","click_visual_representation":"DEVELOPER_AVATAR","actor_id":null,"record_id":null,"originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"1624b778276e789a4b952f4f024b178ca5826544f935e57298c1d5b6ad8c3665\" data-hovercard-type=\"user\" data-hovercard-url=\"/users/a710128/hovercard\" data-octo-click=\"hovercard-link-click\" data-octo-dimensions=\"link_type:self\" href=\"/a710128\"><img class=\"avatar mb-1 avatar-user\" src=\"https://avatars.githubusercontent.com/u/8132992?s=40&v=4\" width=\"20\" height=\"20\" alt=\"@a710128\" /></a>\n <a class=\"d-inline-block\" data-hydro-click=\"{"event_type":"explore.click","payload":{"click_context":"TRENDING_REPOSITORIES_PAGE","click_target":"CONTRIBUTING_DEVELOPER","click_visual_representation":"DEVELOPER_AVATAR","actor_id":null,"record_id":null,"originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"1624b778276e789a4b952f4f024b178ca5826544f935e57298c1d5b6ad8c3665\" data-hovercard-type=\"user\" data-hovercard-url=\"/users/liuxin99/hovercard\" data-octo-click=\"hovercard-link-click\" data-octo-dimensions=\"link_type:self\" href=\"/liuxin99\"><img class=\"avatar mb-1 avatar-user\" src=\"https://avatars.githubusercontent.com/u/33060143?s=40&v=4\" width=\"20\" height=\"20\" alt=\"@liuxin99\" /></a>\n <a class=\"d-inline-block\" data-hydro-click=\"{"event_type":"explore.click","payload":{"click_context":"TRENDING_REPOSITORIES_PAGE","click_target":"CONTRIBUTING_DEVELOPER","click_visual_representation":"DEVELOPER_AVATAR","actor_id":null,"record_id":null,"originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"1624b778276e789a4b952f4f024b178ca5826544f935e57298c1d5b6ad8c3665\" data-hovercard-type=\"user\" data-hovercard-url=\"/users/VoxInstruct/hovercard\" data-octo-click=\"hovercard-link-click\" data-octo-dimensions=\"link_type:self\" href=\"/VoxInstruct\"><img class=\"avatar mb-1 avatar-user\" src=\"https://avatars.githubusercontent.com/u/167593451?s=40&v=4\" width=\"20\" height=\"20\" alt=\"@VoxInstruct\" /></a>\n <a class=\"d-inline-block\" data-hydro-click=\"{"event_type":"explore.click","payload":{"click_context":"TRENDING_REPOSITORIES_PAGE","click_target":"CONTRIBUTING_DEVELOPER","click_visual_representation":"DEVELOPER_AVATAR","actor_id":null,"record_id":null,"originating_url":"https://github.com/trending?since=daily","user_id":null}}\" data-hydro-click-hmac=\"1624b778276e789a4b952f4f024b178ca5826544f935e57298c1d5b6ad8c3665\" data-hovercard-type=\"user\" data-hovercard-url=\"/users/SuperMarioYL/hovercard\" data-octo-click=\"hovercard-link-click\" data-octo-dimensions=\"link_type:self\" href=\"/SuperMarioYL\"><img class=\"avatar mb-1 avatar-user\" src=\"https://avatars.githubusercontent.com/u/20982600?s=40&v=4\" width=\"20\" height=\"20\" alt=\"@SuperMarioYL\" /></a>\n</span>\n <span data-view-component=\"true\" class=\"d-inline-block float-sm-right\">\n <svg aria-hidden=\"true\" data-component=\"Octicon\" height=\"16\" viewBox=\"0 0 16 16\" version=\"1.1\" width=\"16\" data-view-component=\"true\" class=\"octicon octicon-star\">\n <path d=\"M8 .25a.75.75 0 0 1 .673.418l1.882 3.815 4.21.612a.75.75 0 0 1 .416 1.279l-3.046 2.97.719 4.192a.751.751 0 0 1-1.088.791L8 12.347l-3.766 1.98a.75.75 0 0 1-1.088-.79l.72-4.194L.818 6.374a.75.75 0 0 1 .416-1.28l4.21-.611L7.327.668A.75.75 0 0 1 8 .25Zm0 2.445L6.615 5.5a.75.75 0 0 1-.564.41l-3.097.45 2.24 2.184a.75.75 0 0 1 .216.664l-.528 3.084 2.769-1.456a.75.75 0 0 1 .698 0l2.77 1.456-.53-3.084a.75.75 0 0 1 .216-.664l2.24-2.183-3.096-.45a.75.75 0 0 1-.564-.41L8 2.694Z\"></path>\n</svg>\n 639 stars today\n</span> </div>",
"stars_today": 639
},
"repo_detail_raw": {
"id": 1057664813,
"url": "https://api.github.com/repos/OpenBMB/VoxCPM",
"fork": false,
"name": "VoxCPM",
"size": 3078,
"forks": 2710,
"owner": {
"id": 89920203,
"url": "https://api.github.com/users/OpenBMB",
"type": "Organization",
"login": "OpenBMB",
"node_id": "MDEyOk9yZ2FuaXphdGlvbjg5OTIwMjAz",
"html_url": "https://github.com/OpenBMB",
"gists_url": "https://api.github.com/users/OpenBMB/gists{/gist_id}",
"repos_url": "https://api.github.com/users/OpenBMB/repos",
"avatar_url": "https://avatars.githubusercontent.com/u/89920203?v=4",
"events_url": "https://api.github.com/users/OpenBMB/events{/privacy}",
"site_admin": false,
"gravatar_id": "",
"starred_url": "https://api.github.com/users/OpenBMB/starred{/owner}{/repo}",
"followers_url": "https://api.github.com/users/OpenBMB/followers",
"following_url": "https://api.github.com/users/OpenBMB/following{/other_user}",
"user_view_type": "public",
"organizations_url": "https://api.github.com/users/OpenBMB/orgs",
"subscriptions_url": "https://api.github.com/users/OpenBMB/subscriptions",
"received_events_url": "https://api.github.com/users/OpenBMB/received_events"
},
"topics": [
"audio",
"deeplearning",
"minicpm",
"multilingual",
"python",
"pytorch",
"speech",
"speech-synthesis",
"text-to-speech",
"tts",
"tts-model",
"voice-cloning",
"voice-design",
"voxcpm"
],
"git_url": "git://github.com/OpenBMB/VoxCPM.git",
"license": {
"key": "apache-2.0",
"url": "https://api.github.com/licenses/apache-2.0",
"name": "Apache License 2.0",
"node_id": "MDc6TGljZW5zZTI=",
"spdx_id": "Apache-2.0"
},
"node_id": "R_kgDOPwqvLQ",
"private": false,
"ssh_url": "git@github.com:OpenBMB/VoxCPM.git",
"svn_url": "https://github.com/OpenBMB/VoxCPM",
"archived": false,
"disabled": false,
"has_wiki": false,
"homepage": "https://voxcpm.com",
"html_url": "https://github.com/OpenBMB/VoxCPM",
"keys_url": "https://api.github.com/repos/OpenBMB/VoxCPM/keys{/key_id}",
"language": "Python",
"tags_url": "https://api.github.com/repos/OpenBMB/VoxCPM/tags",
"watchers": 23421,
"blobs_url": "https://api.github.com/repos/OpenBMB/VoxCPM/git/blobs{/sha}",
"clone_url": "https://github.com/OpenBMB/VoxCPM.git",
"forks_url": "https://api.github.com/repos/OpenBMB/VoxCPM/forks",
"full_name": "OpenBMB/VoxCPM",
"has_pages": false,
"hooks_url": "https://api.github.com/repos/OpenBMB/VoxCPM/hooks",
"pulls_url": "https://api.github.com/repos/OpenBMB/VoxCPM/pulls{/number}",
"pushed_at": "2026-05-22T03:27:10Z",
"teams_url": "https://api.github.com/repos/OpenBMB/VoxCPM/teams",
"trees_url": "https://api.github.com/repos/OpenBMB/VoxCPM/git/trees{/sha}",
"created_at": "2025-09-16T03:41:49Z",
"events_url": "https://api.github.com/repos/OpenBMB/VoxCPM/events",
"has_issues": true,
"issues_url": "https://api.github.com/repos/OpenBMB/VoxCPM/issues{/number}",
"labels_url": "https://api.github.com/repos/OpenBMB/VoxCPM/labels{/name}",
"merges_url": "https://api.github.com/repos/OpenBMB/VoxCPM/merges",
"mirror_url": null,
"updated_at": "2026-05-31T21:58:21Z",
"visibility": "public",
"archive_url": "https://api.github.com/repos/OpenBMB/VoxCPM/{archive_format}{/ref}",
"commits_url": "https://api.github.com/repos/OpenBMB/VoxCPM/commits{/sha}",
"compare_url": "https://api.github.com/repos/OpenBMB/VoxCPM/compare/{base}...{head}",
"description": "VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning",
"forks_count": 2710,
"is_template": false,
"open_issues": 107,
"branches_url": "https://api.github.com/repos/OpenBMB/VoxCPM/branches{/branch}",
"comments_url": "https://api.github.com/repos/OpenBMB/VoxCPM/comments{/number}",
"contents_url": "https://api.github.com/repos/OpenBMB/VoxCPM/contents/{+path}",
"git_refs_url": "https://api.github.com/repos/OpenBMB/VoxCPM/git/refs{/sha}",
"git_tags_url": "https://api.github.com/repos/OpenBMB/VoxCPM/git/tags{/sha}",
"has_projects": true,
"organization": {
"id": 89920203,
"url": "https://api.github.com/users/OpenBMB",
"type": "Organization",
"login": "OpenBMB",
"node_id": "MDEyOk9yZ2FuaXphdGlvbjg5OTIwMjAz",
"html_url": "https://github.com/OpenBMB",
"gists_url": "https://api.github.com/users/OpenBMB/gists{/gist_id}",
"repos_url": "https://api.github.com/users/OpenBMB/repos",
"avatar_url": "https://avatars.githubusercontent.com/u/89920203?v=4",
"events_url": "https://api.github.com/users/OpenBMB/events{/privacy}",
"site_admin": false,
"gravatar_id": "",
"starred_url": "https://api.github.com/users/OpenBMB/starred{/owner}{/repo}",
"followers_url": "https://api.github.com/users/OpenBMB/followers",
"following_url": "https://api.github.com/users/OpenBMB/following{/other_user}",
"user_view_type": "public",
"organizations_url": "https://api.github.com/users/OpenBMB/orgs",
"subscriptions_url": "https://api.github.com/users/OpenBMB/subscriptions",
"received_events_url": "https://api.github.com/users/OpenBMB/received_events"
},
"releases_url": "https://api.github.com/repos/OpenBMB/VoxCPM/releases{/id}",
"statuses_url": "https://api.github.com/repos/OpenBMB/VoxCPM/statuses/{sha}",
"allow_forking": true,
"assignees_url": "https://api.github.com/repos/OpenBMB/VoxCPM/assignees{/user}",
"downloads_url": "https://api.github.com/repos/OpenBMB/VoxCPM/downloads",
"has_downloads": true,
"languages_url": "https://api.github.com/repos/OpenBMB/VoxCPM/languages",
"network_count": 2710,
"default_branch": "main",
"milestones_url": "https://api.github.com/repos/OpenBMB/VoxCPM/milestones{/number}",
"stargazers_url": "https://api.github.com/repos/OpenBMB/VoxCPM/stargazers",
"watchers_count": 23421,
"deployments_url": "https://api.github.com/repos/OpenBMB/VoxCPM/deployments",
"git_commits_url": "https://api.github.com/repos/OpenBMB/VoxCPM/git/commits{/sha}",
"has_discussions": false,
"subscribers_url": "https://api.github.com/repos/OpenBMB/VoxCPM/subscribers",
"contributors_url": "https://api.github.com/repos/OpenBMB/VoxCPM/contributors",
"issue_events_url": "https://api.github.com/repos/OpenBMB/VoxCPM/issues/events{/number}",
"stargazers_count": 23421,
"subscription_url": "https://api.github.com/repos/OpenBMB/VoxCPM/subscription",
"temp_clone_token": null,
"collaborators_url": "https://api.github.com/repos/OpenBMB/VoxCPM/collaborators{/collaborator}",
"custom_properties": {},
"has_pull_requests": true,
"issue_comment_url": "https://api.github.com/repos/OpenBMB/VoxCPM/issues/comments{/number}",
"notifications_url": "https://api.github.com/repos/OpenBMB/VoxCPM/notifications{?since,all,participating}",
"open_issues_count": 107,
"subscribers_count": 118,
"web_commit_signoff_required": false,
"pull_request_creation_policy": "all"
},
"readme_raw": {
"summary": "<h2 align=\"center\" VoxCPM2: Tokenizer Free TTS for Multilingual Speech Generation, Creative Voice Design, and True to Life Cloning</h2 <p align=\"center\" <b English</b <a href=\"./README_zh.md\" 中文</a",
"raw_text": "<h2 align=\"center\">VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning</h2>\n\n<p align=\"center\">\n <b>English</b> | <a href=\"./README_zh.md\">中文</a>\n</p>\n\n<p align=\"center\">\n <a href=\"https://github.com/OpenBMB/VoxCPM/\"><img src=\"https://img.shields.io/badge/Project%20Page-GitHub-blue\" alt=\"Project Page\"></a>\n <a href=\"https://huggingface.co/spaces/OpenBMB/VoxCPM-Demo\"><img src=\"https://img.shields.io/badge/Live%20Playground-Demo-orange\" alt=\"Live Playground\"></a>\n <a href=\"https://voxcpm.readthedocs.io/en/latest/\"><img src=\"https://img.shields.io/badge/Docs-ReadTheDocs-8CA1AF\" alt=\"Documentation\"></a>\n <a href=\"https://huggingface.co/openbmb/VoxCPM2\"><img src=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-VoxCPM2-yellow\" alt=\"Hugging Face\"></a>\n <a href=\"https://modelscope.cn/models/OpenBMB/VoxCPM2\"><img src=\"https://img.shields.io/badge/ModelScope-VoxCPM2-purple\" alt=\"ModelScope\"></a>\n <a href=\"https://openbmb.github.io/voxcpm2-demopage/\"><img src=\"https://img.shields.io/badge/DemoPage-Audio Samples-red\"></a>\n \n</p>\n\n<div align=\"center\">\n <img src=\"assets/voxcpm_logo.png\" alt=\"VoxCPM Logo\" width=\"35%\">\n <br><br>\n <a href=\"https://trendshift.io/repositories/17704\" target=\"_blank\"><img src=\"https://trendshift.io/api/badge/repositories/17704\" alt=\"OpenBMB%2FVoxCPM | Trendshift\" style=\"width: 250px; height: 55px;\" width=\"250\" height=\"55\"/></a>\n</div>\n\n<br>\n\n<p align=\"center\">\n 👋 Join our community for discussion and support!\n <br>\n <a href=\"./assets/feishu-group.png\" style=\"display:inline-block;vertical-align:middle; margin-left: 10px;\">\n <img src=\"./assets/feishu-logo.png\" width=\"16\" height=\"16\" style=\"vertical-align:middle;\"> Feishu\n </a>\n | \n <a href=\"https://discord.gg/KZUx7tVNwz\" style=\"display:inline-block;vertical-align:middle;\">\n <img src=\"./assets/discord-logo.png\" width=\"16\" height=\"16\" style=\"vertical-align:middle;\"> Discord\n </a>\n</p>\n\nVoxCPM is a **to",
"raw_text_truncated": true
},
"contributors_raw": {
"truncated": true,
"contributors": [
{
"type": "User",
"login": "Labmem-Zhouyx",
"html_url": "https://github.com/Labmem-Zhouyx",
"contributions": 25
},
{
"type": "User",
"login": "liuxin99",
"html_url": "https://github.com/liuxin99",
"contributions": 13
},
{
"type": "User",
"login": "a710128",
"html_url": "https://github.com/a710128",
"contributions": 12
},
{
"type": "User",
"login": "VoxInstruct",
"html_url": "https://github.com/VoxInstruct",
"contributions": 11
},
{
"type": "User",
"login": "MuyleangIng",
"html_url": "https://github.com/MuyleangIng",
"contributions": 4
},
{
"type": "User",
"login": "SuperMarioYL",
"html_url": "https://github.com/SuperMarioYL",
"contributions": 4
},
{
"type": "User",
"login": "AbrahamSanders",
"html_url": "https://github.com/AbrahamSanders",
"contributions": 2
},
{
"type": "User",
"login": "ZMXJJ",
"html_url": "https://github.com/ZMXJJ",
"contributions": 2
},
{
"type": "User",
"login": "Oumnya",
"html_url": "https://github.com/Oumnya",
"contributions": 2
},
{
"type": "User",
"login": "kuishou68",
"html_url": "https://github.com/kuishou68",
"contributions": 1
}
]
},
"funding_raw": {
"path": null,
"exists": false,
"content": null
},
"stats_raw": {
"forks_total": 2710,
"stars_today": 639,
"stars_total": 23421,
"watchers_count": 23421,
"open_issues_count": 107
},
"aux_raw": {
"selected_fields": {
"topics": [
"audio",
"deeplearning",
"minicpm",
"multilingual",
"python",
"pytorch",
"speech",
"speech-synthesis",
"text-to-speech",
"tts",
"tts-model",
"voice-cloning",
"voice-design",
"voxcpm"
],
"is_fork": false,
"license": "Apache-2.0",
"language": "Python",
"owner_type": "Organization",
"forks_total": 2710,
"has_funding": false,
"is_archived": false,
"owner_login": "OpenBMB",
"stars_today": 639,
"stars_total": 23421,
"homepage_url": "https://voxcpm.com",
"default_branch": "main",
"last_pushed_at": "2026-05-22T03:27:10Z",
"readme_summary": "<h2 align=\"center\" VoxCPM2: Tokenizer Free TTS for Multilingual Speech Generation, Creative Voice Design, and True to Life Cloning</h2 <p align=\"center\" <b English</b <a href=\"./README_zh.md\" 中文</a",
"repo_full_name": "OpenBMB/VoxCPM",
"watchers_count": 23421,
"last_updated_at": "2026-05-31T21:58:21Z",
"top_contributors": [
{
"login": "Labmem-Zhouyx",
"contributions": 25
},
{
"login": "liuxin99",
"contributions": 13
},
{
"login": "a710128",
"contributions": 12
},
{
"login": "VoxInstruct",
"contributions": 11
},
{
"login": "MuyleangIng",
"contributions": 4
}
],
"contributor_count": 10,
"funding_platforms": [],
"open_issues_count": 107,
"days_since_created": 257,
"created_at_on_source": "2025-09-16T03:41:49Z",
"days_since_last_push": 9,
"top_contributor_share": 0.329
}
},
"selection_meta": {
"readme_status": "ok",
"funding_status": "ok",
"missing_enrichment": [],
"repo_detail_status": "ok",
"contributors_status": "ok"
},
"created_at": "2026-05-31T22:00:33.245Z",
"updated_at": "2026-05-31T22:00:33.245Z"
}