🥝 Daniël de Kok

Recent Notes

  • ESPHome Senseair S88

    Jan 07, 2026

  • Welcome to Daniël's website

    Jan 05, 2026

  • WGMMA

    Jan 05, 2026

Home

❯

Machine Learning

Machine Learning

Jan 05, 20261 min read

Model building blocks

  • Dish Activation
  • Attention Mechanisms
  • Logit Softcapping

Multi-GPU

  • Model Parallelism:
    • Tensor Parallelism

Quantization

  • Quantizer Notation
  • GPTQ Checkpoint Format

    Graph View

    • Model building blocks
    • Multi-GPU
    • Quantization

    Created with Quartz v4.5.2 © 2026