搜索 WebRTC泄露相关的推文与用户

2026.05.27 22:50

开了梯子还在裸奔？问题在 WebRTC 泄露 😱 去测一下→看到真实运营商 IP = 泄露了我的小火箭配置文件加了这些规则： DOMAIN-KEYWORD,stun,REJECT DOMAIN-KEYWORD,turn,REJECT DOMAIN-SUFFIX, DOMAIN-SUFFIX, DOMAIN-SUFFIX, 加上这两个设置： ✖️ 禁用 STUN / 关闭 UDP 转发 ✖️ fallback-dns-server = system 改掉 ✖️ ipv6 = false 再测一次，全部是 Cloudflare 节点✅ （截图为证）节点稳不稳靠机场，我用这几家： 🔗 v2ny（老牌可靠）→ 🔗 基云科技（稳定之王）→ 🔗 航海（速度扛把子）→ 🔗 一元机场（网红机场·1元起）→ #翻墙# #小火箭# #WebRTC泄露# #隐私安全# #科学上网#

显示更多

0

2

1

0

转发到社区

Sumanth@Sumanth_077

2026.05.21 13:46

Open-source framework for building real-time voice AI agents! Pipecat is a Python framework for orchestrating audio, video, AI services, transports, and conversation pipelines. Voice-first architecture with pluggable components. What you can build: voice assistants, AI companions, multimodal interfaces, interactive storytelling, business agents (customer support, intake), and complex dialog systems. The framework handles speech recognition, text-to-speech, conversation logic, and real-time interaction. WebRTC and WebSocket transport built in. Ultra-low latency for natural conversations. Why Pipecat: • Voice-first: Integrates STT, TTS, and conversation handling in one framework • Pluggable: Supports multiple AI service providers for each capability • Composable pipelines: Build complex behavior from modular components • Real-time: Low-latency interaction with streaming audio/video Supported services: • Speech-to-Text: Deepgram, AssemblyAI, OpenAI Whisper, Groq, Azure, AWS, Google, and more • LLMs: OpenAI, Anthropic, Gemini, Groq, Mistral, Ollama, AWS, Azure, and more • Text-to-Speech: OpenAI, ElevenLabs, Deepgram, Cartesia, Azure, AWS, Google, and more • Speech-to-Speech: OpenAI Realtime, Gemini Multimodal Live, AWS Nova Sonic, Ultravox, Grok Voice Agent 10.3k+ stars on GitHub. I've shared link to the repo in the comments!

显示更多

0

3

7

6

转发到社区

meng shao@shao__meng

2026.05.21 04:19

在 Codex/Claude Code 等 Coding Agents 领域，文字是主要的输入输出方式；而在更广泛的通用 Agents 领域，特别是陪伴、实时交互等 Agents 方面，实时语音交互非常重要，语音的仿真生动程度、语音响应的及时性，这些都是 Voice Agent 在 LLM 基础之上要考虑的重点。 Voice Agent 的搭建过程，模型主要包括 ASR、VOD、TTS、LLM 等，而通信基础主要依靠 WebRTC 这个在直播和在线会议场景最通用的方案，前几天 OpenAI 也针对实时语音发布了 WebRTC 相关的技术方案。在 WebRTC 领域，有一个非常常用的方案团队：Agora，他们也推出了 Agora Skills，让 AI Agent 可以快速安装和理解、使用。今天咱们就看看基于 Codex 安装使用 Agora Skills 的完整过程。首先是 Agora Skills 安装，我只需要告诉 Codex：“安装 Agora Skills：分钟后 Codex 自动安装完成。安装完成它向 Codex 展示 Skills 的主要内容，包括了 Agora 的 RTC、RTM、Conversational AI、CLI 等多个产品的直接集成。因为 Agora Skills 的使用涉及到 Agora Token 认证，在 CLI 中也可以快速完成登录和环境变量设置保存，在网页端登录一次后，就不需要再离开 Codex 了。然后我让 Codex 帮我用 Agora Skills 写一个 Demo：用 Agora Skills 帮我搭一个浏览器里的 voice AI agent demo，从登录 Agora、创建项目到本地跑通，把关键log和性能数据展示出来。也是完全 Codex 自动读取 Skills 后完整，我没有介入，说明 Skills 中各种能力的编排和集成做的还是很到位，也是2-3 分钟后，Demo 就写完并运行起来了。这个 Demo 的功能主要是语音实时对话，从对话体感上看，很流畅，接近于人和人之间语音通话的响应延迟，语音包和 LLM 都可以切换，这里我只做了默认集成。看几个关键数据： · 整个 RTC、RTM、Conversational AI 启动过程在2-3秒内，很快 · 从我说话结束，到 Voice Agent 首个语音包输出（我听到声音），1秒左右如果你在做 Voice Agent 方面的探索，可以接入 Agora Skills 快速验证你的想法，让你的 Agent 能实时和你对话。抛砖几个场景，朋友们可以去尝试回来再交流：给 Agent 做一个会说话的陪伴形象、虚拟男女友、把声音和形象装进智能硬件。。

显示更多

0

1

11

2

转发到社区

Garry Tan@garrytan

2026.04.11 20:53

Just launched GBrain v0.8.0 If you have it installed, you can just ask your Claw/Hermes to upgrade to the latest GBrain and we'll automatically ask if you want to install your Voice WebRTC endpoint and Twilio number It's a true mega brain-trip to talk to your agent directly.

显示更多

0

58

758

60

转发到社区

与「WebRTC泄露」相关的搜索结果