aiwg

Version:

Cognitive architecture for AI-augmented software development with structured memory, ensemble validation, and closed-loop correction. FAIR-aligned artifacts, 84% cost reduction via human-in-the-loop, standards adopted by 100+ organizations.

aiwg.io

jmagly/aiwg

283 lines (201 loc) • 18.4 kB

Markdown

# REF-006: Cognitive Load During Problem Solving: Effects on Learning ## Citation Sweller, J. (1988). Cognitive load during problem solving: Effects on learning. *Cognitive Science*, 12(2), 257-285. **DOI**: [https://doi.org/10.1207/s15516709cog1202_4](https://doi.org/10.1207/s15516709cog1202_4) **Link**: [Wiley Online Library](https://onlinelibrary.wiley.com/doi/abs/10.1207/s15516709cog1202_4) **PDF**: `docs/references/pdfs/REF-006-sweller-1988-clt.pdf` ## Document Profile | Attribute | Value | |-----------|-------| | Pages | 29 | | Year | 1988 | | Venue | Cognitive Science | | Type | Experimental/Theoretical with Computational Model | | AIWG Relevance | **Critical** - Foundational theory for template design, documentation structure, and managing cognitive load in multi-agent workflows | ## Executive Summary John Sweller's seminal paper introduced Cognitive Load Theory (CLT), demonstrating that conventional problem-solving via means-ends analysis imposes such heavy cognitive load that it interferes with learning. The paper presents both theoretical analysis and experimental evidence showing that problem-solving and schema acquisition require different cognitive processes that compete for limited working memory capacity. Sweller distinguished between solving problems (which uses means-ends analysis) and learning from problems (which requires schema acquisition), showing these are often incompatible goals. The paper presents three types of evidence: (1) a computational model showing means-ends analysis requires 4 productions vs 1 for goal-free problems, (2) experimental data showing conventional problem solvers made 4-6x more mathematical errors than goal-free problem solvers, and (3) a dual-task experiment demonstrating that reducing cognitive load through goal-free problems significantly improved memory of problem structure (critical for schema acquisition). This work fundamentally challenged the assumption that problem-solving practice is an effective learning device, suggesting instead that worked examples and goal-free problems are superior for novice learners - a finding with profound implications for instructional design and, by extension, for designing AI-assisted development workflows. ## Key Findings ### 1. Schema Acquisition as Primary Factor in Expertise Sweller synthesized expert-novice research showing that domain-specific knowledge in the form of schemas distinguishes experts from novices across multiple dimensions: - **Memory of problem states**: Experts recall realistic problem configurations far better than novices (chess: De Groot, 1966; Chase & Simon, 1973; circuits: Egan & Schwartz, 1979; algebra: Sweller & Cooper, 1985) - **Problem-solving strategies**: Experts work forward from givens; novices use means-ends analysis working backward from goals (Larkin et al., 1980; Simon & Simon, 1978) - **Problem categorization**: Experts categorize by solution principles; novices by surface features (Chi, Glaser, & Rees, 1982; Hinsley, Hayes, & Simon, 1977) **Key insight**: "A schema is defined as a structure which allows problem solvers to recognize a problem state as belonging to a particular category of problem states that normally require particular moves" (p. 259). All three expert advantages stem from possessing domain-specific schemas. ### 2. Means-Ends Analysis Interferes with Learning Through maze problems (Sweller & Levine, 1982) and puzzle problems (Mawer & Sweller, 1982; Sweller, 1983), Sweller demonstrated that: - Subjects with **specific goals** solved problems successfully but **failed to learn essential problem structure** - Subjects with **nonspecific goals** (find any solution, not a particular one) **rapidly learned structural features** - When maze problems had no visible goal, forcing exploration rather than means-ends analysis, subjects learned structure 3-5x faster **Critical finding**: "While subjects had little difficulty solving these problems, they tended not to induce the relevant rules. This aspect of the problem structure could only be readily induced if considerable additional information was implicitly or explicitly provided" (p. 260). ### 3. Cognitive Load Mechanism: Competing Processes Sweller identified **two mechanisms** explaining why means-ends analysis blocks learning: **Selective Attention Mismatch**: - Means-ends analysis requires attending to: current state, goal state, differences between them, available operators, goal stack (if using subgoals) - Schema acquisition requires attending to: problem state patterns, operators associated with those states, relationships between states and moves - **These attention requirements overlap insufficiently**: "Previously used problem-solving operators and the relations between problem states and operators can be totally ignored by problem solvers using this strategy" (p. 261) **Limited Cognitive Processing Capacity**: - Means-ends analysis consumes most available working memory capacity - "The cognitive-processing capacity needed to handle this information may be of such a magnitude as to leave little for schema acquisition, even if the problem is solved" (p. 261) - Under some conditions, "a problem solver whose entire cognitive processing capacity is devoted to goal attainment... [is] attending to this aspect of the problem to the exclusion of those features necessary for schema acquisition" (p. 262) ### 4. Computational Model Evidence Sweller built a minimal production system model comparing cognitive load of means-ends vs. goal-free strategies on equation-chaining problems (kinematics, geometry, trigonometry): **Means-Ends Strategy** (4 productions): 1. If goal is only unknown in equation → solve for goal 2. If equation contains goal + unknowns → set unknowns as subgoals 3. If equation contains subgoals + unknowns → set new subgoals 4. If subgoal is only unknown in equation → solve for subgoal **Goal-Free Strategy** (1 production): 5. If equation has only one unknown → solve for that unknown **Results on 3-step problem** (Table 2, p. 272): | Measure | Means-Ends | Goal-Free | Ratio | |---------|-----------|-----------|-------| | Average working memory | 15.5 | 14.0 | 1.11:1 | | Peak working memory | 16 | 14 | 1.14:1 | | **Number of productions** | **4** | **1** | **4:1** | | Number of cycles | 5 | 3 | 1.67:1 | | **Total conditions matched** | **29** | **17** | **1.71:1** | **Interpretation**: The 4:1 production ratio and 1.71:1 conditions-matched ratio indicate means-ends analysis requires substantially more cognitive processing capacity. Since the model is minimal (adding realistic features would only increase complexity), this provides strong evidence for the cognitive overload hypothesis. ### 5. Experimental Validation: Mathematical Errors Owen and Sweller (1985) compared conventional vs. nonspecific-goal trigonometry problems matched for time (not number of problems). **Results**: Conventional problem solvers made **4-6 times more mathematical errors** (misuse of sine/cosine/tangent ratios) per side calculated than goal-free problem solvers. **Example** (Experiment 1): Conventional group averaged 1.2, 1.0, 0.3, 0.5, 0.5, 0.6 errors per problem; Nonspecific goal group averaged 1.2, 1.1, 0.25, 0.7, 0.75, 0.2 errors (Table 3, p. 280). **Explanation**: "Problem solvers organizing a problem according to means-ends principles suffer from a cognitive overload which leaves little capacity for other aspects of the task. This overload can be manifested by an increase in the number of mathematical errors made" (p. 276). ## Benchmark/Experimental Results ### Dual-Task Experiment (Current Paper) **Design**: 24 Year 10 students solved 6 trigonometry problems (conventional vs. nonspecific goal) while memorizing problem structure and solution for later reproduction. **Primary Task Performance** (Table 3): | Group | Problem 1 | Problem 2 | Problem 3 | Problem 4 | Problem 5 | Problem 6 | Mean | |-------|----------|----------|----------|----------|----------|----------|------| | **Conventional (sec)** | 274 | 239 | 170 | 188 | 193 | 138 | 200 | | **Goal-Free (sec)** | 272 | 227 | 156 | 170 | 191 | 126 | 190 | **Secondary Task Performance - Reproduction Errors** (Table 6, p. 281): | Error Type | Conventional | Goal-Free | Significance | |-----------|--------------|-----------|--------------| | Segment labels | 2.9 | 2.4 | ns | | Angle value | 3.6 | 3.4 | ns | | **Angle position** | **2.4** | **1.1** | **p < .05** | | **Side value** | **2.6** | **1.3** | **p < .05** | | **Side position** | **1.6** | **1.2** | **p < .05** | | **Solution** | **3.4** | **1.5** | **p < .001** | **Key finding**: Goal-free subjects showed **significantly better memory** for structural features (angle/side positions, solution steps) critical for schema acquisition, but **not** for surface features (segment labels, specific values) - exactly what the theory predicts. ### Nonspecific Goal Advantages (Multiple Studies) **Sweller, Mawer, & Ward (1983)** - Physics/geometry problems with goal "Calculate the value of as many variables as you can" vs. specific goals: - Goal-free group developed expertise more rapidly - Transferred better to subsequent problems - Made fewer errors on later problems **Sweller & Levine (1982)** - Maze problems: - **Specific goal group**: Failed to induce structural features, prevented from solving simple problems - **Nonspecific goal group**: Rapid learning of essential structural characteristics ### Production System Validation **Example problem**: "A car starts from rest and accelerates uniformly at 2 m/s² in a straight line with average velocity of 17 m/s. How far has it travelled?" **Equations**: s=vt, v=.5V, V=at **Means-ends solution** (5 cycles): 1. Find equation with goal s → sets t as subgoal (Prod. 2) 2. Find equation with subgoal t → sets V as subgoal (Prod. 3) 3. Solve v=.5V for V (Prod. 4) 4. Solve V=at for t (Prod. 4) 5. Solve s=vt for s (Prod. 1) **Goal-free solution** (3 cycles): 1. Solve v=.5V for V (Prod. 5) 2. Solve V=at for t (Prod. 5) 3. Solve s=vt for s (Prod. 5) Both reach same answer; means-ends requires 67% more cycles and 4x more production rules. ## Key Quotes for Citation > "Considerable evidence indicates that domain specific knowledge in the form of schemas is the primary factor distinguishing experts from novices in problem-solving skill. Evidence that conventional problem-solving activity is not effective in schema acquisition is also accumulating." (p. 257) > "A major reason for the ineffectiveness of problem solving as a learning device, is that the cognitive processes required by the two activities overlap insufficiently, and that conventional problem solving in the form of means-ends analysis requires a relatively large amount of cognitive processing capacity which is consequently unavailable for schema acquisition." (p. 257) > "The cognitive-processing capacity needed to handle this information may be of such a magnitude as to leave little for schema acquisition, even if the problem is solved." (p. 261) > "Solving a problem and acquiring schemas may require largely unrelated cognitive processes." (p. 261) > "Conventional problem solving in the form of means-ends analysis, while facilitating problem solution, could frequently prevent problem solvers from learning essential aspects of a problem's structure." (p. 260) > "Cognitive load theory suggests that effective instructional material facilitates learning by directing cognitive resources toward activities that are relevant to learning rather than toward preliminaries to learning." (p. 103 - from related 1998 paper referenced) > "The results provide no evidence that an increased load during conventional problem solving assists problem solvers in assimilating information concerning the initial problem structure or the solution steps." (p. 282) ## AIWG Implementation Mapping | CLT Principle | AIWG Implementation | |---------------|---------------------| | **Worked Example Effect** | Extensive template library providing scaffolded structures rather than blank documents; SDLC framework provides 50+ worked example templates | | **Split-Attention Effect** | Integrated documentation keeping context together; single-file agent definitions with capabilities, tools, and examples co-located; @-mention system for inline cross-references | | **Redundancy Effect** | Agent personas avoid redundant explanations; templates include only essential sections; phase gate checklists focus on must-have criteria | | **Goal-Free Effect** | Ralph loop uses completion criteria (state-based) rather than procedural goals; agents given exploration permission before converging on solution | | **Modality Principle** | Multi-format documentation (markdown, diagrams, code examples); visual workflow diagrams complement textual descriptions | | **Element Interactivity** | Complex SDLC decomposed into phases (Inception→Elaboration→Construction→Transition→Production); each phase has isolated concerns | | **Expertise Reversal** | Multi-tier documentation: Quick Start (novices) → Developer Guide (intermediates) → Reference (experts); progressive disclosure of complexity | ### Specific AIWG Design Decisions Informed by CLT **1. Template-First Approach**: - Rather than asking agents to "write requirements document following IEEE 830," provide `use-case-template.md` with placeholders - Reduces extraneous load by eliminating need to recall/discover document structure - Based on worked example effect: studying examples > solving equivalent problems **2. Integrated Context Management**: - `.aiwg/` directory co-locates all artifacts (requirements, architecture, tests) - Cross-references use inline @-mentions rather than separate appendices - Example: `@implements @.aiwg/requirements/UC-001.md` in code files - Eliminates split-attention between separate documents **3. Phase-Based Decomposition**: - SDLC divided into 5 phases, each with distinct cognitive load profile - Inception: Broad exploration (low intrinsic load) - Elaboration: Architecture decisions (high intrinsic load, minimize extraneous) - Reduces element interactivity by temporal separation of concerns **4. Agent Specialization**: - 58 agents with bounded expertise vs. 1 general agent - Security Auditor doesn't need to hold full application context in working memory - Maps to CLT principle: partition complex domains into manageable schemas **5. Progressive Disclosure**: - Quick Start: 5-minute setup (novice-optimized, high guidance) - Full Guide: Complete workflows (intermediate, moderate guidance) - Reference: All 31 CLI commands (expert, minimal guidance) - Implements expertise reversal effect: what helps novices may burden experts **6. Checklist Design**: - Phase gate criteria limited to 5-7 per category (respects Miller's 7±2 and reduces load) - Hierarchical grouping prevents cognitive overload from long flat lists - Example: Test Strategy template has 6 top-level sections, each with ≤7 subsections **7. Ralph Loop Design**: - **Completion criteria** (state: "npm test passes") rather than procedural goals (steps: "fix Test 1, then Test 2...") - Reduces cognitive load by eliminating need for subgoal management - Allows agent to explore solution space without means-ends analysis overhead - Direct application of Sweller's goal-free effect findings ## Cross-References | Reference | Relationship | |-----------|--------------| | **REF-005** (Miller's Law) | Provides working memory capacity foundation (7±2 items) that CLT builds upon; chunking concept underlies schema formation | | **REF-007** (Mixture of Experts) | Multi-agent architecture reduces per-agent cognitive load by bounded expertise; aligns with CLT's domain partitioning | | **REF-010** (Stage-Gate) | Phase gates provide cognitive "rest points" where load resets; staged progression prevents cumulative overload | | **REF-011** (Requirements Traceability) | @-mention wiring reduces split-attention load; integrated traceability supports schema formation | ## Quick Reference Locations | Topic | Location | |-------|----------| | **Expert-novice distinctions** | Pages 258-259 | | **Memory of problem states** | Page 258 | | **Problem-solving strategies** | Pages 258-259 | | **Problem categorization** | Page 259 | | **Schema definition** | Page 259 | | **Learning interference evidence** | Pages 260-261 | | **Selective attention mechanism** | Page 261 | | **Cognitive capacity mechanism** | Pages 261-262 | | **Forward-working strategies** | Page 262 | | **Nonspecific goal advantages** | Pages 262-263 | | **Computational model overview** | Pages 263-265 | | **Production system details** | Pages 265-270 | | **Means-ends productions** | Pages 266-269, Table 1 | | **Goal-free production** | Pages 270-271, Table 1 | | **Cognitive load measures** | Pages 272-273, Table 2 | | **Experimental method** | Pages 278-279 | | **Dual-task results** | Pages 279-282, Tables 3-6 | | **Error analysis** | Pages 275-276, 281 | | **Theoretical implications** | Pages 283-284 | | **Educational applications** | Pages 283-284 | ## Related Work **Foundational Papers**: - Miller, G. A. (1956). The magical number seven, plus or minus two (REF-005) - Chase, W., & Simon, H. (1973). Perception in chess - Chi, M., Glaser, R., & Rees, E. (1982). Expertise in problem solving **Follow-Up CLT Research**: - Sweller, J., van Merriënboer, J. J., & Paas, F. G. (1998). Cognitive architecture and instructional design - Kalyuga, S. (2007). Expertise reversal effect and its implications for learner-tailored instruction - Chandler, P., & Sweller, J. (1991). Cognitive load theory and the format of instruction **Application to AI Systems**: - Modern prompt engineering reflects CLT principles (few-shot examples = worked examples) - Context window management in LLMs directly addresses cognitive load constraints - AIWG's multi-agent patterns implement CLT at system architecture level ## Revision History | Date | Author | Changes | |------|--------|---------| | 2026-01-24 | Research Acquisition (#74) | Comprehensive update from full paper review - added complete experimental results, computational model details, dual-task experiment, detailed implementation mapping, extensive quotes, and production system analysis |