Sparsity in LLM
#Day4 of Being an Imposter 😛 Sparsity in #LLMs refers to the fraction of parameters that are active during an […]
#Day4 of Being an Imposter 😛 Sparsity in #LLMs refers to the fraction of parameters that are active during an […]
Day 1 of being an imposter 😛 RoPE (Rotary Positional Embedding) is crazy good way to reduce the dimensional space,