resource: 1-bit-LLMs.pdf BitNet b1.58 every single parameter (or weight) of the LLM is ternary {-1, 0, 1}