Top
New
🔦
limoce
joined
4/3/2021, 1:02 PM
has
861
karma
Posts
FP8 is ~100 tflops faster when the kernel name has "cutlass" in it
by
limoce
on 7/11/2025, 10:36 AM
with
107
comments
Polaris: A Post-training recipe for scaling RL on Advanced Reasoning models
by
limoce
on 7/9/2025, 6:58 AM
with
1
comments
Overclocking LLM Reasoning: Monitoring and Controlling LLM Thinking Path Lengths
by
limoce
on 7/6/2025, 12:53 PM
with
0
comments
Neutrino: Probing-Based eBPF-Like GPU Kernel Profiling
by
limoce
on 7/1/2025, 10:33 AM
with
0
comments
Machine Learning Conferences Should Establish "Refutations and Critiques" Track
by
limoce
on 6/26/2025, 10:27 AM
with
0
comments
SuperGPQA: Scaling LLM Evaluation Across 285 Graduate Disciplines
by
limoce
on 3/4/2025, 7:26 AM
with
0
comments
SepLLM: Accelerate LLMs by Compressing One Segment into One Separator
by
limoce
on 3/3/2025, 1:27 PM
with
2
comments
Step-Video-T2V: The Practice, Challenges, and Future of Video Foundation Model
by
limoce
on 2/17/2025, 9:54 AM
with
5
comments
Logic R1: Reproduce DeepSeek R1 Zero on 2K Logic Puzzle Dataset
by
limoce
on 2/5/2025, 4:02 AM
with
0
comments
Libnginx: Nginx as a Shared Library
by
limoce
on 2/4/2025, 7:56 AM
with
0
comments
DeepSeek-VL2: Moe Vision-Language Models for Advanced Multimodal Understanding [pdf]
by
limoce
on 12/13/2024, 12:53 PM
with
0
comments
Fast vectorizable algorithms of binary searching for floating point numbers
by
limoce
on 11/15/2024, 12:53 AM
with
0
comments
New OpenAI Feature: Predicted Outputs
by
limoce
on 11/5/2024, 2:47 AM
with
7
comments
Collaborative Filtering Is Wrong and Here Is Why
by
limoce
on 10/24/2024, 9:03 AM
with
0
comments
REST: A Plug-and-Play Method for Accelerating LLM Without Additional Training
by
limoce
on 10/20/2024, 6:13 AM
with
0
comments
Smoke 'em if you got 'em: Hacker gains root access using cigarette lighter
by
limoce
on 10/12/2024, 1:20 PM
with
0
comments
O1 Replication Journey: A Strategic Progress Report
by
limoce
on 10/9/2024, 8:09 AM
with
0
comments
Failures of Gradient-Based Deep Learning (2017) [pdf]
by
limoce
on 8/15/2024, 10:19 AM
with
0
comments
Qwen2-VL
by
limoce
on 8/14/2024, 8:20 AM
with
0
comments
Qwen2-Math
by
limoce
on 8/8/2024, 3:00 PM
with
38
comments
FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention
by
limoce
on 8/8/2024, 7:24 AM
with
24
comments
MiniCPM-v2.6: GPT-4V Level MLLM for Single/Multi Image and Video on Your Phone
by
limoce
on 8/7/2024, 2:00 AM
with
0
comments
MindSearch: LLM-Based Web Search Engine Similar to Perplexity.ai and SearchGPT
by
limoce
on 8/1/2024, 8:53 AM
with
0
comments
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters
by
limoce
on 6/12/2024, 11:01 AM
with
0
comments
PowerInfer-2: Fast Large Language Model Inference on a Smartphone
by
limoce
on 6/11/2024, 2:19 PM
with
0
comments
Large-scale photonic chiplet Taichi empowers 160TOPS/W AI
by
limoce
on 4/12/2024, 7:49 AM
with
0
comments
Asterinas: OS kernel written in Rust and providing Linux-compatible ABI
by
limoce
on 3/5/2024, 8:52 AM
with
0
comments
Mq-deadline scalability improvements (with more than 100% improvement)
by
limoce
on 1/20/2024, 12:04 PM
with
0
comments