semantic commitment

  • Exploring Hidden Dimensions in Llama-3.2-3B


    Llama 3.2 3B fMRI LOAD BEARING DIMS FOUNDA local interpretability toolchain has been developed to explore the coupling of hidden dimensions in small language models, specifically Llama-3.2-3B-Instruct. By focusing on deterministic decoding and stratified prompts, the toolchain reduces noise and identifies key dimensions that significantly influence model behavior. A causal test revealed that perturbing a critical dimension, DIM 1731, causes a collapse in semantic commitment while maintaining fluency, suggesting its role in decision-stability. This discovery highlights the existence of high-centrality dimensions that are crucial for model functionality and opens pathways for further exploration and replication across models. Understanding these dimensions is essential for improving the reliability and interpretability of AI models.

    Read Full Article: Exploring Hidden Dimensions in Llama-3.2-3B