Thank you so much fort worth!
It was an awesome night🤙
It was the only concert in fort worth on this tour
But don't be sad texas!🤠
We still have one more in Houston tonight!
So please stay tuned🤟
#
ENHYPEN# #
JAY# #
MANIFESTO_IN_FORTWORTH#
显示更多
0
0
4.4K
187.9K
45.4K
转发到社区
i don’t wanna be moody on the tl but making so much content for ig every day really helped me with creativity and structure. it’s been a month now since being banned. i’m just a bit lost and sad idk
显示更多
🚨Important Announcement: Puberty Blockers Judicial Review🚨
Following months of radio silence, I’m saddened to report that the government has announced that it is pushing forward with the puberty blockers trial, regardless of the significant ethical concerns raised that led to the temporary pause.
Most troubling of all is that they are now refusing to halt recruitment of children until the end of the Judicial Review that we are bringing.
As such, we have no choice but to seek an emergency injunction to block a single child being recruited and given this poison. There will be a hearing at the end of July to determine this.
Please rest assured that I and the entire team will be pursuing this Judicial Review all the way.
@jamesmurray_ldn - Just as with your predecessor, Wes Streeting, we implore you to do the right thing and pull the plug on this monstrosity of a trial. If you don’t, we will see you in Court.
显示更多
this is my personal singularity moment
this post may sound like a paid ad. I only wish. I'm concerned, more so than happy. the world is changing, and, among the scenarios where AI goes terribly wrong, inequality is the most realistic, yet, the one Anthropic seems to be the least concerned about. I'm glad OpenAI is taking the opposite stance: *personal AGI for everyone*. I think this is a commendable position in the times we live. but who am I in the queue of the bread?
anyway, Fable is here, so I'll just report my first-hour experience
first of all, all my pet prompts are solved.
→ λ-calculus puzzles
→ bug questions
→ one-shot apps
all are trivial to it.
I don't have anything harder other than my
ongoing work
so, in the last several days, I've been toying with HVM5, a new interaction net evaluator with a faster loop.
after writing the first version, I left 32 GPT-5 agents working for ~20 hours each. this resulted in up to 2x speedups, but the file size increased by 2-fold and quality decreased significantly.
I then simplified the whole thing into an even simpler core, and left Opus 4.8 and GPT 5.5 optimizing it for 8 hours. Opus got a legit 6% - 34% speedup in most benches. GPT got better results, but, sadly, an unusable file.
I then asked Fable to optimize it.
2 hours later, it landed a 1770% speedup in one case, 100%+ in other 4, and 22% in average. yes, in 2 hours it outperformed me, opus 4.8 and a swarm of gpt 5.5 agents, by one order of magnitude.
that could not possibly be legit. "it must be hardcoding the benchmarks" (GPT trauma). so I read its explanation and what it did was, indeed, the most high impact optimization one could try first. seems like HVM5 was wasting a lot of time garbage-collecting unused branches of pattern-match nodes. I had optimized that for static mats, but not for dynamic mats. skill issue. Fable figured how to do it for these, resulting in a massive speedup in some benches
but wait, is that *correct*? I'm not sure yet, it is credible, but this is the kind of thing that is very easy to get wrong on interaction nets. the problem is, when I was ready to start auditing Fable's solution so I could tell whether it was buggy or legit, it interrupted me to tell me it had found a massive bug on the code *I* had written.
... wait, what?
so... for garbage collection purposes, I stored a bit on lambda term pointers that meant "the variable bound by this lambda has been freed, so, its lambda must free whatever argument it is applied to". that's fine. yet, on duplicator nodes, I also used the same bit to mean "one of the duplicated variables was freed, so, treat this dup as a passthrough no-op". so, if a lambda entered a duplicator, it would mistake the lambda's collection bit for its own, resulting in corrupted interaction!
that's a mouthful, why I'm writing this?
just so you can appreciate the sheer absurdity of what just happened. I didn't ask it to find bugs. I asked it for an optimization. and even if I did ask it to find bugs, this bug is so astonishingly subtle and specific, identifying it takes mastering the domain to an extent that it beyond even me. I'd easily need hours or days to fix it, *if* I ever came across it. chances are it would just go unnoticed. and Fable found it and fixed it like it was nothing, while it was busy adding a 17x speedup to a file that neither I, nor Opus 4.8, nor a fleet of GPT 5.5 managed to barely make 2x faster.
oh and there is also another tab where it is also ripping through Bend's codebase and finishing everything I had to do
I don't know what to say anymore
this isn't about Anthropic or OpenAI, this is about our collective future as a species. the world is changing, and we need to be aware of it, and discuss how to handle this change.
receipt below . . .
显示更多
Crypto has given us everything we wanted
Only because we gave something to it first
Today the new people think they’re owed something because they’re here
When we joined the industry we didn’t feel that level of entitlement and still don’t
Sad state of affairs with the take only attitude
Funny enough the biggest takers are the least credible and have done nothing important or memorable
At some point this has to change and the swamp gets drained
Likely to be from cannibalism like we are seeing now from the leeches leech from each other
There are a lot of great people here that i have gotten to know and become part of their journey
Many more of them have also left because of the late stage capitalism we are seeing here
Its admiring to still see people striving for impact and fighting for progress multiple cycles later
We have also reached a point of adoption where everyone knows what crypto is and have heard everything about it
But have not a great experience onchain
It’s now a game of delivery and overall experience
They are aware of the traditional
problems and are seeking active solutions
50%
显示更多
Henry Nowak's death is more horrific than you think:
>Henry was stabbed ~11:30 pm, Henry was not pronounced dead until 67 minutes later (12:37 am). It gets worse...
> During this time. Dagwa & his brother (who arrived shortly after the attack, it was his brother who phoned 999, not to phone an ambulance, he phoned police alleging Henry attacked them.
>It's been reported that there was some deliberation/ delay before phoning 999. Alleging Henry drunkenly attacked them (Henry was sober; blood alcohol below the drink-drive limit). Digwa's brother wanted to punish Henry.
>Exact quotes from the call (read out in court at Southampton Crown Court):
“We’ve just got attacked racially by some white person. He’s physically attacked my brother, we’re Sikhs, we wear a turban and he’s just attacked my brother. We’re restraining him right now because he’s just attacked my brother and took my brother’s turban off. He also said, he’s verbally attacked my brother racially. I’m not having this as a regular occurrence, I live here, I’m not having this a regular occurrence. He ain’t fighting people, he’s racially attacking people, that’s what he’s doing. Nah, he sees some brown people, that’s what it was.”
> were restraining Henry until the police arrived (Digwa stole Henry's phone so he couldn't get help).
>When the police arrived, Digwa's father was holding Henry against a wall (his father said: "He keeps dropping down, so I am just trying to keep him up". There was also a visible blood trail, but it is unknown when officers first noticed it (different sources described it when the police entered the scene, another was after Henry passed out).
>His mother removed the murder weapon from the scene.
>Police bodycam footage was played in court (audio only, no video; another source said a transcript was read):
Henry says “I am dying”;
Digwa replies “You’re not dying bro.”
{Approximately 10 minutes later}: Henry says “You stabbed me”;
Digwa denies it and accuses Nowak of recording him.
Henry's final recorded words: “Please brother, I can’t breathe.”
{He passed out a few minutes later}
>Before the attack, Henry was recording a video of Digwa on his phone, it is a weird exchange: Henry singing/yawning, then addressing Digwa: “Innit bad man, what bad man. You’re a bad man, say you’re a bad man, go on.” Digwa replied: “I am a bad man.” The footage ended shortly before the stabbing.
Some critical details that haven't been released:
>The time the 999 call was made.
>The full 999 transcript.
>The time the police arrived.
>The time an ambulance was called (an air ambulance >flew in a doctor).
I question the order of the stabbing:
>There's a lack of defensive wounds on Henry's arms and hands (Henry was sober).
>I believe Henry was stabbed in the groin and the back if the legs while he was trying to scale a fence to get away (you can't easily get to a man's groin area, there's a reason they're nicknamed the 'crown jewels'). Also, stab wounds to that area can be catastrophic; The aorta and arteries to the legs (the largest in the body) flow through there, not to mention the nerve endings. The way they pinned Henry against a wall, where he would be losing blood faster.
>Given what I have read so far, I don't understand why there haven't been charges against the brother & father. They were aware that Henry had been stabbed, but they continued to forcefully detain him (the very definition of false imprisonment). I'd argue it was sadistic torture. You can make the excuse of a single Sekh having mental health issues (they will), but that doesn't excuse the actions of Digwa's brother, mother & father.
Some of the research & sources:
显示更多
I recently spent 2 weeks in China.
6 cities: Shanghai, Beijing, Xi’an, Zhangjiajie, Chongqing and Chengdu.
I went there with curiosity.
Like many Indians, I had heard a lot about China through media, social media and conversations. I expected to see progress, maybe discover some business ideas, and understand what the country is actually building.
I came back with a very uncomfortable feeling.
Not because I found a business idea for myself.
But because I saw 100 things that governments can do when infrastructure, tourism, transport, urban planning and civic systems are treated seriously.
I travelled within China by flights, trains, cars and local transport. The infrastructure was honestly stunning.
Clean cities. Smooth roads. High-speed trains. Well-managed traffic. Public spaces that actually feel designed for people. Tourist destinations that are built, maintained and promoted like national assets.
And then I kept thinking about India.
We keep comparing ourselves to China. Our media keeps telling us how India is catching up, how China is restrictive, how we are better in so many ways.
After spending time there and speaking to people, I realised how much of that narrative is just comfort food.
China is not perfect. No country is.
But on infrastructure, execution, tourism, civic discipline and quality of urban life, they are not 5 years ahead of us.
They are decades ahead.
The saddest part for me was the currency.
Everything felt expensive. Not because China was insanely expensive, but because the rupee has weakened so much that even normal spending starts feeling heavy. As an Indian taxpayer, that genuinely hurt.
We pay taxes. We work hard. We talk about becoming a global power.
But where is the quality of life?
Where is the civic sense?
Where is the infrastructure that makes daily life easier?
Where is the tourism vision beyond religious tourism?
I met travellers from other countries who were excited to visit China because they wanted to see its progress. When I asked about India, many had no real desire to visit. Not out of hate. India simply was not on their aspirational travel list.
That should bother us.
Even the so-called “closed internet” surprised me. We are told people there are missing out because they don’t use Google, Instagram, WhatsApp or Facebook.
But China has built its own digital ecosystem. Payments, maps, transport, messaging, shopping, everything works inside their own infrastructure. People did not seem to feel deprived. They seemed adapted.
Again, this is not a hate post.
I love India. That is exactly why this trip bothered me.
Patriotism cannot only be about saying we are great.
Real patriotism is having the courage to admit where we are falling behind.
China made me realise one thing very clearly:
India’s potential is not the problem.
Execution is.
And unless we stop comforting ourselves with comparisons and start demanding better infrastructure, better governance, better tourism, cleaner cities and a higher quality of life, we will keep celebrating the idea of progress instead of actually living it.
显示更多
I was so touched reading comments under the video clip of Xiao Zhan eating ice cream on Douyin that I decided to translate some of them. Just like the way #
XiaoZhan# sends “postcards” to XFXs on his every trip. XFXs also confides in him like a brother or a family member.
(Part 1. I put a few more comments under part 2)
💌 Zhan Zhan, I want to share with you a bit about my life. My recent situation is not very good. Maybe that's why when I listen to the background music, tears suddenly flow. Family pressure, and a small child... I don't know if my choice a year ago was right? I used to have work and colleagues. But now I've given up my job and become a full-time mother. Maybe I am not a qualified mother to say this. When I was a child, I was carefree and wanted to grow up quickly. Now I long to live those carefree days. Okay, no more complaining. Let's move forward together, you also. 🌹
💌 Zhan gege, retaking the exam is really tiring 😭. I will definitely pass in 2025, right gege?
💌 Zhan Zhan, I have been trying to get pregnant for many years without success. Next month I will try IVF. I am a bit scared, but I also hope to have a baby smoothly.
💌 Gege, should I take the postgraduate entrance exam? I can't make up my mind.
💌 I live alone in a foreign country. Watching Xiao Zhan's vlog and reading XFXs’ sincere messages moved me to tears ❤️
💌 Zhan gege, today is my birthday. Today I am 21 years old 😆. I will dedicate my birthday wish to you. Guess what I wished for, I will tell you quietly. I wish that my Zhan Zhan will have a smooth and healthy life.
💌 Gege, I just submitted my graduation thesis. I'm looking for a job now, and feeling so much pressure. I don't know where my next journey in life is going. My college years are almost over, and I feel sad. But when I see you abroad, it really feels healing. It's like I’m breathing a moment of free time with you, and seeing my carefree self in a parallel world 😭
💌 Good evening, Zhan Zhan. I just got back from Beijing three days ago. I'm a little anxious and confused about what kind of job to look for next.
💌 Gege, let me show you today’s sunset at my place.
💌 Zhan gege, I've been so busy lately. Every day I'm overwhelmed with work. I feel like I'm about to collapse. Watching the video and listening to this bgm, in this moment, I suddenly miss you so much. 😭
💌 Zhan gege, I am about to get married, but I am still so confused. I don’t know if I can be a good wife, or a qualified mother in the future... I always have an inexplicable fear of the unknown future...
💌 Zhan Zhan, actually I have been very tired lately. Last week I accompanied my dad to Guangzhou for surgery. Last week my mom was hospitalized and I also had to accompany my dad to do dialysis. Today I had to take the blame for someone at work. Tonight after helping my boy with his homework, I watched your video as soon as he went to bed. The vlog felt so warm and beautiful. The feeling of helplessness recently made me want to cry but couldn't. I've long considered you a relative, a very close friend in my heart. So when I saw you, I naturally shed tears unconsciously. Let me confide my feelings with you in this comment section. Thank you for your presence and comfort. I will try to cheer up and continue to work hard to live my life well. I also wish you all the best, health, safety, happiness and worry-free.
💌 I want to eat that ice cream. Marriage is bitter. I don't want to try it in my next life.
💌 Zhan Zhan, I will retire in 5 years. I hope you can hold a concert so I can go see it. I've known you since 2019. I really want to see you at least once.
💌 Rewatching this clip made me want to cry, even though I was so happy watching the vlog earlier at noon. Seeing you eating like a kitten, you must be happy. You were so self-disciplined that you only ate one glutinous rice ball. But you bought your favorite ice cream and ate it slowly.
显示更多
I was given early access to Grok 3 earlier today, making me I think one of the first few who could run a quick vibe check.
Thinking
✅ First, Grok 3 clearly has an around state of the art thinking model ("Think" button) and did great out of the box on my Settler's of Catan question:
"Create a board game webpage showing a hex grid, just like in the game Settlers of Catan. Each hex grid is numbered from 1..N, where N is the total number of hex tiles. Make it generic, so one can change the number of "rings" using a slider. For example in Catan the radius is 3 hexes. Single html page please."
Few models get this right reliably. The top OpenAI thinking models (e.g. o1-pro, at $200/month) get it too, but all of DeepSeek-R1, Gemini 2.0 Flash Thinking, and Claude do not.
❌ It did not solve my "Emoji mystery" question where I give a smiling face with an attached message hidden inside Unicode variation selectors, even when I give a strong hint on how to decode it in the form of Rust code. The most progress I've seen is from DeepSeek-R1 which once partially decoded the message.
❓ It solved a few tic tac toe boards I gave it with a pretty nice/clean chain of thought (many SOTA models often fail these!). So I upped the difficulty and asked it to generate 3 "tricky" tic tac toe boards, which it failed on (generating nonsense boards / text), but then so did o1 pro.
✅ I uploaded GPT-2 paper. I asked a bunch of simple lookup questions, all worked great. Then asked to estimate the number of training flops it took to train GPT-2, with no searching. This is tricky because the number of tokens is not spelled out so it has to be partially estimated and partially calculated, stressing all of lookup, knowledge, and math. One example is 40GB of text ~= 40B characters ~= 40B bytes (assume ASCII) ~= 10B tokens (assume ~4 bytes/tok), at ~10 epochs ~= 100B token training run, at 1.5B params and with 2+4=6 flops/param/token, this is 100e9 X 1.5e9 X 6 ~= 1e21 FLOPs. Both Grok 3 and 4o fail this task, but Grok 3 with Thinking solves it great, while o1 pro (GPT thinking model) fails.
I like that the model *will* attempt to solve the Riemann hypothesis when asked to, similar to DeepSeek-R1 but unlike many other models that give up instantly (o1-pro, Claude, Gemini 2.0 Flash Thinking) and simply say that it is a great unsolved problem. I had to stop it eventually because I felt a bit bad for it, but it showed courage and who knows, maybe one day...
The impression overall I got here is that this is somewhere around o1-pro capability, and ahead of DeepSeek-R1, though of course we need actual, real evaluations to look at.
DeepSearch
Very neat offering that seems to combine something along the lines of what OpenAI / Perplexity call "Deep Research", together with thinking. Except instead of "Deep Research" it is "Deep Search" (sigh). Can produce high quality responses to various researchy / lookupy questions you could imagine have answers in article on the internet, e.g. a few I tried, which I stole from my recent search history on Perplexity, along with how it went:
- ✅ "What's up with the upcoming Apple Launch? Any rumors?"
- ✅ "Why is Palantir stock surging recently?"
- ✅ "White Lotus 3 where was it filmed and is it the same team as Seasons 1 and 2?"
- ✅ "What toothpaste does Bryan Johnson use?"
- ❌ "Singles Inferno Season 4 cast where are they now?"
- ❌ "What speech to text program has Simon Willison mentioned he's using?"
❌ I did find some sharp edges here. E.g. the model doesn't seem to like to reference X as a source by default, though you can explicitly ask it to. A few times I caught it hallucinating URLs that don't exist. A few times it said factual things that I think are incorrect and it didn't provide a citation for it (it probably doesn't exist). E.g. it told me that "Kim Jeong-su is still dating Kim Min-seol" of Singles Inferno Season 4, which surely is totally off, right? And when I asked it to create a report on the major LLM labs and their amount of total funding and estimate of employee count, it listed 12 major labs but not itself (xAI).
The impression I get of DeepSearch is that it's approximately around Perplexity DeepResearch offering (which is great!), but not yet at the level of OpenAI's recently released "Deep Research", which still feels more thorough and reliable (though still nowhere perfect, e.g. it, too, quite incorrectly excludes xAI as a "major LLM labs" when I tried with it...).
Random LLM "gotcha"s
I tried a few more fun / random LLM gotcha queries I like to try now and then. Gotchas are queries that specifically on the easy side for humans but on the hard side for LLMs, so I was curious which of them Grok 3 makes progress on.
✅ Grok 3 knows there are 3 "r" in "strawberry", but then it also told me there are only 3 "L" in LOLLAPALOOZA. Turning on Thinking solves this.
✅ Grok 3 told me 9.11 > 9.9. (common with other LLMs too), but again, turning on Thinking solves it.
✅ Few simple puzzles worked ok even without thinking, e.g. *"Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?"*. E.g. GPT4o says 2 (incorrectly).
❌ Sadly the model's sense of humor does not appear to be obviously improved. This is a common LLM issue with humor capability and general mode collapse, famously, e.g. 90% of 1,008 outputs asking ChatGPT for joke were repetitions of the same 25 jokes. Even when prompted in more detail away from simple pun territory (e.g. give me a standup), I'm not sure that it is state of the art humor. Example generated joke: "*Why did the chicken join a band? Because it had the drumsticks and wanted to be a cluck-star!*". In quick testing, thinking did not help, possibly it made it a bit worse.
❌ Model still appears to be just a bit too overly sensitive to "complex ethical issues", e.g. generated a 1 page essay basically refusing to answer whether it might be ethically justifiable to misgender someone if it meant saving 1 million people from dying.
❌ Simon Willison's "*Generate an SVG of a pelican riding a bicycle*". It stresses the LLMs ability to lay out many elements on a 2D grid, which is very difficult because the LLMs can't "see" like people do, so it's arranging things in the dark, in text. Marking as fail because these pelicans are qutie good but, but still a bit broken (see image and comparisons). Claude's are best, but imo I suspect they specifically targeted SVG capability during training.
Summary. As far as a quick vibe check over ~2 hours this morning, Grok 3 + Thinking feels somewhere around the state of the art territory of OpenAI's strongest models (o1-pro, $200/month), and slightly better than DeepSeek-R1 and Gemini 2.0 Flash Thinking. Which is quite incredible considering that the team started from scratch ~1 year ago, this timescale to state of the art territory is unprecedented. Do also keep in mind the caveats - the models are stochastic and may give slightly different answers each time, and it is very early, so we'll have to wait for a lot more evaluations over a period of the next few days/weeks. The early LM arena results look quite encouraging indeed. For now, big congrats to the xAI team, they clearly have huge velocity and momentum and I am excited to add Grok 3 to my "LLM council" and hear what it thinks going forward.
显示更多