Tokasaurus: An LLM inference engine for high-throughput workloads (scalingintelligence.stanford.edu)
137 points by rsehrlich 8 hours ago | 18 comments
1137 points by rsehrlich 8 hours ago | 18 comments
1432 points by bdr 16 hours ago | 151 comments
279 points by mikebannister 7 hours ago | 36 comments
3120 points by BUFU 5 hours ago | 122 comments
449 points by wey-gu 5 hours ago | 13 comments
569 points by noleary 7 hours ago | 15 comments
658 points by echollama 7 hours ago | 32 comments
721 points by nativeit 4 hours ago | 6 comments
832 points by DavideNL 40 minutes ago | 3 comments
980 points by ofalkaed 8 hours ago | 30 comments
1021 points by benbreen 9 hours ago | 2 comments
11371 points by 256dpi a day ago | 158 comments
12165 points by zdw 14 hours ago | 81 comments
1374 points by rmason 10 hours ago | 6 comments
1438 points by us-merul 7 hours ago | 9 comments
1579 points by bundie 13 hours ago | 78 comments
1623 points by jorkingit 6 hours ago | 10 comments
17626 points by doener a day ago | 343 comments
1849 points by softwaredoug 11 hours ago | 2 comments
19138 points by mrcgnc 6 hours ago | 103 comments
2078 points by mitchbob 3 days ago | 43 comments
219 hours ago
2284 points by anteloper 12 hours ago | 43 comments
2345 points by 90s_dev 12 hours ago | 14 comments
2416 points by mful 3 hours ago | 7 comments
25199 points by mikeshi42 12 hours ago | 46 comments
26203 points by robertvc 11 hours ago | 102 comments
2728 points by rbanffy 9 hours ago | 21 comments
28325 points by picture a day ago | 264 comments
2953 points by aluzzardi 12 hours ago | 13 comments
30