François Chollet (@fchollet) “JAX is fast ⏩ Benchmarking Mistral 7B inference on a V100 (float16, batch_size”

François Chollet

@fchollet

Co-founder @ndea. Co-founder @arcprize. Creator of Keras and ARC-AGI. Author of 'Deep Learning with Python'.

加入 August 2009

826 正在关注 689.3K 粉丝

François Chollet@fchollet

2024.02.29 22:31

JAX is fast ⏩ Benchmarking Mistral 7B inference on a V100 (float16, batch_size 10): the throughput of the KerasNLP implementation with JAX is over 2x higher than the Hugging Face PyTorch one (compiled). Worth noting that this is "out of the box" performance: the KerasNLP model is not optimized for performance. It's written the way anyone would naively write a Keras 3 LLM.

显示更多