Common AI Mistakes On HPC Systems

Be especially skeptical when the model:

Invents sbatch or srun flags
Confuses login-node work with compute-node work
Assumes pip install is always the right move
Guesses the wrong module names or versions
Mixes up $HOME, $SCRATCH, and project storage
Suggests an MPI launch pattern that does not match your code
Assumes GPU access without the right Slurm constraints

These are common failure modes, not rare edge cases.

Vibe Coding at NERSC

Practical AI-Assisted Coding for NERSC Users

What Is Vibe Coding

And What It Is Not

Vibe Coding

What Is Agentic AI

Chatbot vs Coding Agent

Why It Matters

For NERSC Users

Why This Matters at NERSC

Good First Use Cases

Less Suitable Use Cases

How It Works

The Building Blocks

Building Blocks

Why Tool Use Matters

Why CLI Agents Matter

Beyond Basic Chat

RAG, MCP, and Skills

RAG In Practice

MCP Servers

Skills And Reusable Workflows

Coding Assistants

What the Tooling Looks Like

Types of Coding Assistants

Installing a Coding Agent

Start Small

Best Practices

Getting Better Results

Good Engineering Helps Good AI

Context Engineering

Prompting That Works

Weak Prompt vs Strong Prompt

Weak

Strong

A Practical Workflow

Bigger Projects

Multi-Agent Use

Subagents

Why Subagents Help

Good Subagent Patterns

Safety and Security

Keep Humans In Charge

Human Responsibilities Do Not Go Away

Security Considerations

Sandboxing

Why Sandboxing Matters

Common Sandbox Modes

Sandbox Design Principles

Open Sandbox Options

Common Failure Modes

Verification Is The Whole Game

Practical HPC Examples

Where This Helps

Example: Drafting A Slurm Script

Example: A Perlmutter GPU Job

Example: Debugging A Failed Job

Example: Debugging With sacct

Example: Converting A Workflow

Example: Scaling A Training Workflow

What Makes That Example Useful

Example: Documentation Acceleration

Common AI Mistakes On HPC Systems

Slurm And Module Advice Needs Verification

Prompt Pattern For NERSC Tasks

Other Useful Command Line Tools

Closing Thought

Capability + Control

Closing Thought

If You Only Remember Four Things

References

References

Thank you!

Questions?

Example: Debugging With `sacct`