Tag

#inference

1 item tagged #inference

tools1s readjust now

New 1-bit quantization runs 70B models on a laptop.

Why it mattersLocal inference just got dramatically cheaper for indie builders.