gsnn.models.GroupRMSNorm
Group-wise Root Mean Square Layer Normalization.
RMSNorm is a simpler alternative to layer normalization that only uses the RMS for normalization without mean centering. It’s particularly stable for small batch sizes and computationally more efficient than layer norm.
Classes
|
Applies Root Mean Square normalization within each channel group. |
- class gsnn.models.GroupRMSNorm.GroupRMSNorm(*args: Any, **kwargs: Any)[source]
Bases:
ModuleApplies Root Mean Square normalization within each channel group.
RMSNorm normalizes using only the RMS (root mean square) without mean centering, making it simpler and more stable than layer normalization, especially for small batch sizes.
- Parameters: