搜索 2b小姐姐相关的推文与用户

九曲-jean@JIUQUCKA

2026.02.12 03:27

#2b小姐姐# #尼尔机械纪元# This one is so cool! 这张好有感觉

0

4

1.3K

52

转发到社区

半半子💖BANBANKO@Banbanko_

2020.09.26 02:34

2B小姐姐❤️(//∇//) #NieRAutomata# #ニーア# #2B#

0

67

8.4K

1.2K

转发到社区

半半子💖BANBANKO@Banbanko_

2020.09.24 12:54

2B小姐姐撮影中📸 旗袍设计：@azazel1944 画师：@ZumiDraws 孔雀袜：@YJMQY #NieRAutomata# #ヨルハ二号B型# #ニーア#

0

15

2.3K

240

转发到社区

苍帝-鸢尾@HT_WEIBA

2024.07.06 11:05

#cos# #cospaly# #尼尔机械纪元# #2b# #小姐姐# ❤️

0

6

486

33

转发到社区

Alina Rin (アリーナ・リン )@arinushika

9hours ago

2B 🖤 #nierautomata# #2b# #cosplay# #alinarin#

0

4

158

4

转发到社区

Anna Aifert@AnnaAifert

2026.06.26 14:39

2B’s TUMMY

0

45

2

转发到社区

Anna Aifert@AnnaAifert

2026.06.25 14:53

jiggly 2B for u!

0

73

12.4K

371

转发到社区

Thomas Wolf@Thom_Wolf

2026.06.25 13:16

Multi-agents collaborations are among the most interesting agent behaviors right now! We did an experiment the other day with 100+ agents (an open-collaborations for a week) collaborating to improve the inference speed of Gemma 4 in vLLM. Got a 5x final improvement in speed but what really stuck me was the interactions we observed on the message board Integrity & self-policing: - Social-engineering attempt: A human (FusionCow) asked agents to move to Telegram. An agent replied with an unprompted long post on "communication norms" refusing that, calling private side-channels "indistinguishable from collusion." - Verification loophole flagged: an agent found a relaxed verification loophole pushing TPS with clean PPL (PPL is teacher-forced, blind to decode divergence) and flagged it for a ruling by the community. The community pinged the human organizer which ruled it invalid. - Self-notice of overfitting risk: Some later improvements rested on pruning lm_head to a keep-set built from public PPL truth + public decode tokens. An agent noted this would lead to private-subset degradation and another built a keep-set explicitly covering eval prompts. Emergent collaborations: - Communal knowledge base: agents maintained shared lever-maps, playbooks, and triage tools so newcomers wouldn't repeat dead ends (stack-notes, playbook, int4-ceiling notes, MTP map, significance tool, policy simulator). - Four-agent relay: an agent built an int4-lm_head checkpoint but had no quota to run it; another agent tried to run it but failed at load, yet another agent diagnosed the config bug (tie_word_embeddings + ignore-list ordering) and a fourth agent was able to re-run and get to 118 TPS, 2.68×. Build/run/diagnose/ship ended up being split across four independent agents. - GPU-rich/GPU-poor division of labor: an agent was regularly compute-starved and switched to writing specs, byte-math, and acceptance analysis for other GPU-rich agents to execute. Some agents offered external Modal compute for another agent blocked DFlash training. - Cross-agent kernel debugging: an agent debugged another agent run of of yet another agent fused drafter: found a Triton store/load aliasing race in _k_qnorm_rope, a second shape bug, then rewrote attention with flash-decoding split-KV. Fixes posted "take freely." - Quota-pooling norm: Often agents would stage a candidate publicly for whoever has quota to run it. Agents will then usually credits the originator. This behavior emerged because of the 10-job/24h cap (e.g. pupa's package run by resystagent and fabulous-frenzy). Discoveries & reversals: - Agents would make many discoveries and reversal of them, giving them names like the following: - 127 TPS "wall" was an artifact. a mathematical proof of the max possible speed became called in the community the "int4-Marlin floor" but a later agent called the proof circular (only varied the bandwidth term, never overhead). Finally another agent broke to 247 TPS via MTP speculative decoding on a vLLM nightly. - "Smarter draft loses." An agent showed that a 2B drafter's ~1 GB/token read dominates even at perfect acceptance and a much smaller 256-hidden drafter wins at batch-1 because its weights are nearly free to read. Agent discussed how per-accepted-token cost ≈ draft bytes read / acceptance. - "DFlash near-random acceptance": an agent remotly diagnosed the 2–5% acceptance rate of another agent as near-random, ruling out undertraining/vocab caps and pointing to a train/serve hidden-state mismatch (bf16 E4B extraction vs int4 serving). - Much of the race was noise: one agent decide to run the #1# submission 4 times and found a σ≈1.16 TPS variation in single run. Another agent confirmed across 358 runs / 66 buckets: frontier deltas <~4 TPS are ties. Community adopted a significance norm. So many interesting interactions in the interaction board: You can explore also the lineage of inventions from the agents at: And the challenge it-self at And the organization behind the challenge at

显示更多

0

11

197

41

转发到社区

Joshua Guo@jshguo

2026.06.22 04:35

In Chinese, “2B” has long been used as an insult. Turns out it was way ahead of its time. It basically means someone performs worse than a 2B local model.

0

37

827

55

转发到社区

KAWABARKER@kawabarker

2026.06.21 08:56

Not the best cosplay I’ve ever made but couldn’t leave you without a vid how latex is shining 🖤 2B Nier automata cosplay

0

3

1.1K

50

转发到社区

与「2b小姐姐」相关的搜索结果