A Binary Neural Network (BNN) is a neural network where both the weights and activations are constrained to binary values, typically +1 and -1 (or 1 and 0). This radical form of quantization replaces standard 32-bit floating-point operations with highly efficient bitwise XNOR and popcount operations, drastically reducing memory footprint and computational cost. The primary goal is to enable the deployment of complex models on resource-constrained edge devices like microcontrollers and smartphones, where power and memory are limited.
