# VLEO-DSMC **Repository Path**: spacecube/VLEO-DSMC ## Basic Information - **Project Name**: VLEO-DSMC - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2026-06-03 - **Last Updated**: 2026-06-03 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README Updated May 19 2026 # VLEO-DSMC: Satellite Atmospheric Simulation with SPARTA VLEO-DSMC is a simulation toolkit built on [SPARTA](https://sparta.github.io/) for analyzing spacecraft aerodynamics across flow regimes in **Very Low Earth Orbit (VLEO)** using the Direct Simulation Monte Carlo (DSMC) method. It integrates real NRLMSIS-2.1 atmospheric data to model the full thermospheric composition (N₂, O₂, O, He, Ar, N) with altitude-dependent species fractions from 70–500 km. Import your satellite geometry as an STL file and compute aerodynamic drag, surface heating, velocity fields, and streamlines. The included Python scripts handle atmospheric data generation, dump file handling, and visualization. Particle animations, temperature heatmaps, flow field plots, and multi-altitude drag analysis are all supported out of the box. MPI parallelization enables faster execution on multi-core systems.

## Table of Contents **For a quick start, follow along with sections 1-5:** 0. [Simulation Overview](#simulation-overview) 1. [Installing SPARTA](#1-install-sparta) 2. [Setting Up Python Environment](#2-set-up-python-environment) 3. [Running Simulations](#3-run-simulations) 4. [Converting Dump Data](#4-convert-dump-data-for-memory-efficient-analysis) 5. [Visualization and Analysis Scripts](#5-visualization-and-analysis-scripts) 6. [Atmospheric Data and Species Composition](#6-atmospheric-data-and-species-composition) 7. [Surface Geometry and STL Conversion](#7-surface-geometry-and-stl-conversion) 8. [Input File Configurations](#8-input-file-configurations) 9. [Drag Calculation Methods](#9-drag-calculation-methods) 10. [Best Practices](#11-best-practices) 11. [Running on Penn State Roar Supercomputer](#11-running-on-roar-supercomputer) ## 0. Simulation Overview ### What is DSMC? **Direct Simulation Monte Carlo (DSMC)** is a computational method for simulating rarefied gas flows where the continuum assumption breaks down. Instead of solving fluid equations, DSMC tracks individual representative particles (each particle is really a cluster of many, many actual particles) and models molecular collisions probabilistically. This simulation toolkit uses SPARTA DSMC to render continuum, transition, and free molecular atmospheric flows around an orbiting spacecraft in VLEO using real NRLMSIS atmospheric data. ### How It Works The following parameters are shown for a ~30 minute simulation on my laptop (AMD Ryzen 9 5900HS, 8 cores/16 threads, 40GB RAM). Simulations in the free-molecular regime can be run on this setup. For analysis into the continuum regime, a cluster may be required, since a shorter mean-free path requires higher resolution to accurately simulate. See [Running on Penn State Roar Supercomputer](#11-running-on-roar-supercomputer) for more details. You can test run the program and it will immediately output what the maximum cell size and timesteps can be: (example) ``` CELL SIZE MUST BE < 0.01 m TIMESTEP MUST BE < 1*10^-6 s ``` For lower fidelity, decrease grid dimensions, number of particles, and time steps, but make sure that constraints are followed for computational accuracy. #### Physical Domain - **3D Cartesian domain:** 2.2m × 2.2m × 2.2m cube (±1.1m in each direction) - **Boundary conditions:** - X-direction: Outflow (gas escapes at +X boundary, injected at -X) - Y,Z-directions: Periodic (wraparound) - **Grid resolution:** 350 × 200 × 50 cells - **Cell size:** λ/3 (one-third of mean free path) for accurate collision modeling #### Atmospheric Modeling The simulation integrates real atmospheric data using the **NRLMSIS-2.1 empirical model**: - **Density (ρ):** Mass density at specified altitude - **Number density (nrho):** Molecular concentration - **Temperature (T):** Atmospheric temperature - **Bulk velocity (vx):** Free-stream velocity (orbital speed at given altitude) **Supported altitude range:** 70-500 km (thermosphere/mesopause region) - NRLMSIS data extends to 1000 km, but current species data is most accurate for lower thermosphere - Full atmospheric composition is now supported: N₂, O₂, O, He, Ar, N (see [Atmospheric Data](#4-atmospheric-data-and-species-composition)) #### Particle Representation - **Target particles:** 500,000–2,000,000 computational particles (configurable via `Ns_target`) - **Species:** Full NRLMSIS composition (N₂, O₂, O, He, Ar, N with altitude-dependent fractions) - **Weighting factor:** Each computational particle represents ~10¹⁰-10¹² real molecules - **Injection:** Continuous inflow of atmospheric gas at domain boundary #### Collision Physics **Variable Soft Sphere (VSS) Model:** - **Probabilistic collisions:** Particles don't physically collide; instead collision probability is calculated based on: - Molecular cross-sections (σ ≈ 3.7×10⁻¹⁰ m diameter) - Relative velocities - Local density - **Mean free path:** λ = kT/(√2πd²ρR) ≈ meters at high altitude - **Collision frequency:** Determined by kinetic theory and local gas properties - **VSS parameters:** Defined in `vss/air.vss` for all supported species (VSS = Variable Soft Sphere) #### Surface Interactions **Diffuse Surface Model:** - **Surface geometry:** 3D satellite model (STL → surface mesh) - **Temperature coupling:** Stefan-Boltzmann radiation (ε = 0.9, T₀ = 300K) - **Molecular accommodation:** Gas molecules thermalize with surface temperature - **Energy transfer:** Tracks kinetic + internal energy flux to surface #### Time - **Timestep:** 1×10⁻⁷ seconds (much smaller than collision time) - **Duration:** 10,000 timesteps (1.0 milliseconds physical time) - **Diagnostics:** Data output every 100 timesteps (100 frames total) #### Scientific Applications This simulation models **satellite atmospheric drag and heating** in the thermosphere, relevant for: - Low Earth Orbit (LEO) satellite design - Atmospheric re-entry analysis - Hypersonic vehicle aerothermodynamics - Spacecraft surface temperature prediction The DSMC method is essential at altitude ranges where the Knudsen number (Kn = λ/L > 0.1) indicates rarefied flow conditions that may violate continuum assumptions used in traditional CFD. ## 1. Install SPARTA ### Prerequisites ```bash sudo apt update sudo apt install build-essential gfortran mpich ``` ### Build SPARTA ```bash git clone https://github.com/sparta/sparta.git cd sparta/src # Build MPI version (recommended for multi-core execution) make mpi # Creates: spa_mpi # Build serial version (single core only) make serial # Creates: spa_serial ``` **Which to use?** Build `spa_mpi` if you want to run simulations on multiple cores (recommended). The scripts in this repo (`run_sparta.sh`, `multi_altitude.py`) use `mpirun` and require `spa_mpi`. ### Create a symlink To use `sparta` as the command name (as used in `run_sparta.sh` and `multi_altitude.py`): ```bash cd ~/sparta/src ln -s spa_mpi sparta # Recommended: enables multi-core runs with mpirun ``` Or for serial only (not recommended): ```bash cd ~/sparta/src ln -s spa_serial sparta ``` **Important:** You will likely use the `run_sparta.sh` script, which utilizes `mpirun` for parallel execution. You **must** link to `spa_mpi` (first code snippet above) to run this. Using `spa_serial` with `mpirun` will launch multiple independent serial processes instead of one parallel simulation; they won't communicate and results will be incorrect. ### Add to PATH For **bash** (most common): ```bash echo 'export PATH="$PATH:$HOME/sparta/src"' >> ~/.bashrc source ~/.bashrc ``` For **tcsh/csh**: ```tcsh echo 'setenv PATH "${PATH}:${HOME}/sparta/src"' >> ~/.cshrc source ~/.cshrc ``` ### Verify installation ```bash sparta -h # Should print help/version info # OR test directly: ~/sparta/src/sparta -h # Verify MPI is working (should say "Running on N MPI task(s)"): mpirun -np 2 sparta -h # Ignore the 'non-zero status' error that comes after this. ``` If `sparta -h` doesn't work but the direct path does, your PATH isn't set correctly for your shell. **Additional SPARTA documentation:** Extensive documentation is available at https://sparta.github.io/ - I highly recommend perusing it; it isn't terribly long, it can be tremendously helpful, and getting an idea about what is included before you start will save you hours of troubleshooting! ## 2. Set Up Python Environment ```bash cd ~/AMPT python3 -m venv .venv source .venv/bin/activate pip install -r requirements.txt ``` You must run `source .venv/bin/activate` **every time you open a new terminal** (unless you're using system-wide Python via `apt`, which doesn't require activation). I have it set up like this to isolate dependencies and guarantee it will run on any machine with just requirements.txt To auto-activate in **VS Code** (not necessary): - Press `Ctrl+Shift+P` - Type `Python: Select Interpreter` - Choose `.venv/bin/python` from the list ## 3. Running Simulations **!! NOTE:** `run_sparta.sh` and all analysis scripts expect the input file to be named `in.ampt`. To run a simulation, copy your chosen configuration to `in.ampt` first. Example: `cp in.general_surface in.ampt` ### Single Altitude Simulation ```bash # Generate atmospheric data for specific altitude (70-500 km) python3 tools/load_atm_data.py 150 # km # Run SPARTA simulation (single core) sparta < in.ampt # OR run with multiple cores for faster execution ./run_sparta.sh # Uses 8 cores by default. Modify this to work with your setup. ``` **Performance:** Multi-core execution is typically 4-6x faster for DSMC simulations. ### Multi-Altitude Analysis ```bash # Single core (default) python3 multi_altitude.py # Multi-core parallel execution (faster) python3 multi_altitude.py --cores 8 python3 multi_altitude.py -c 4 # Edit altitude list in multi_altitude.py ``` This will: - Automatically run SPARTA simulations at multiple altitudes using `in.ampt` - Save results to `dumps/alt_XXkm/` directories - **Automatically convert dumps to Parquet format** for memory-efficient analysis **Performance:** Using `--cores 8` is typically 4-6x faster than single core for DSMC simulations. **Important:** `multi_altitude.py` expects the input file to be named `in.ampt`. To run the script, copy your chosen configuration to `in.ampt` first. Example: `cp in.general_surface in.ampt` ### Analyze Multi-Altitude Results After running multi-altitude simulations: ```bash # Analyze drag vs altitude (animated plot + CSV export) python3 scripts/analyze_multi_altitude_drag.py # Export CSV only (no plot) python3 scripts/analyze_multi_altitude_drag.py --csv # Analyze surface temperature vs altitude (animated plot) python3 scripts/analyze_multi_altitude_temp.py ``` Output files: - `outputs/drag_vs_altitude.csv` - Total drag vs altitude data - `outputs/ram_drag_vs_altitude.csv` - Ram drag component (if using cube geometry) - `outputs/skin_friction_vs_altitude.csv` - Skin friction component (if using cube geometry) - `outputs/multi_altitude_drag_evolution.mp4` - Animated drag plot (linear scale) - `outputs/multi_altitude_drag_evolution_log.mp4` - Animated drag plot (log scale) - `outputs/multi_altitude_temp_evolution.mp4` - Animated temperature plot ## 4. Convert Dump Data for Memory-Efficient Analysis After running simulations, convert dump data to Parquet format for memory-efficient analysis: ```bash # Convert dumps from default directory (dumps/) python3 tools/load_dumps.py # Convert dumps from specific directory python3 tools/load_dumps.py dumps/alt_XXkm/ ``` This will: - Parse raw dump files (part.*.dat, grid.*.dat, surf.*.dat) - Save memory-efficient Parquet files (.parquet) in the same directory (uses significantly less RAM than pickle files) - Can handle large (100 GB+) datasets without crashes - Python analysis scripts automatically use Parquet files for streaming data access Large SPARTA simulations can generate GB of particle data. The old pickle format was simpler in that it generated a single usable file for each set of dumps, but would cause RAM crashes when loading entire datasets. Parquet enables streaming access (i.e. loading only one timestep at a time) which reduces memory usage. ### Parquet Loading Functions For use in custom analysis scripts: ```python from tools.load_dumps import load_parquet_timesteps, load_parquet_single # Get list of available timesteps without loading data timesteps = load_parquet_timesteps("particle", "dumps/alt_100km") # Load a single timestep (memory efficient) step, df, box = load_parquet_single("particle", timesteps[0], "dumps/alt_100km") ``` ## 5. Visualization and Analysis Scripts All visualization scripts are located in `scripts/` and output to `outputs/`. Most scripts default to reading from `dumps/` but can analyze specific altitude data. ### Particle Animation ```bash python3 scripts/animate_particles.py [folder] ``` - Creates 3D scatter plot animation of particle positions - Subsamples to 5000 particles for performance - **Output:** `outputs/particle_anim.mp4`

### Surface Temperature Heatmap ```bash python3 scripts/surface_temp_heatmap.py [folder] ``` - Animated 3D visualization of surface temperatures with energy flux annotations - Rotating view with colorbar - **Output:** `outputs/surface_temp_heatmap.mp4` - **Note:** Uses `in.ampt` to extract timestep size

### Grid Temperature Heatmap ```bash python3 scripts/grid_temp_heatmap.py [folder] ``` - 2D heatmap animation of gas temperature in a horizontal slice (|z| ≤ 5% of domain height) - Uses native grid resolution from SPARTA - **Output:** `outputs/grid_temp_heatmap.mp4` - **Note:** Uses `in.ampt` to extract timestep size

### Velocity Heatmap ```bash python3 scripts/velocity_heatmap.py [folder] ``` - 2D heatmap animation of particle speed in a horizontal slice - Uses 500×300 binning resolution - **Output:** `outputs/velocity_heatmap.mp4` - **Note:** Uses `in.ampt` to extract timestep size

### Streamlines ```bash # Static snapshot (final timestep) python3 scripts/streamlines.py # Animated streamlines python3 scripts/streamlines.py --anim ``` - Visualizes flow field with streamlines overlaid on speed heatmap - Uses flow dump data (per-cell velocity averages) - Red rectangle indicates object outline - **Output:** `outputs/streamlines_2d.png` or `outputs/streamlines_anim.mp4`

_{Static Streamlines}

_{Animated Streamlines}

### Drag Analysis (Single Run) ```bash python3 scripts/plot_drag.py [--show] [--out path.png] [--csv out.csv] ``` - Plots drag vs timestep from `dumps/direct_drag.dat` - See [Drag Calculation Methods](#9-drag-calculation-methods) for details - **Output:** `outputs/drag.png` (default) ### Multi-Altitude Drag Analysis ```bash python3 scripts/analyze_multi_altitude_drag.py [--csv] ``` - Analyzes drag across multiple altitudes from `dumps/alt_*km/` directories - Creates animated plots showing drag components vs altitude over time - **Output:** - `outputs/drag_vs_altitude.csv` - `outputs/multi_altitude_drag_evolution.mp4` (linear scale) - `outputs/multi_altitude_drag_evolution_log.mp4` (log scale)

_{Linear Scale}

_{Log Scale}

### Multi-Altitude Temperature Analysis ```bash python3 scripts/analyze_multi_altitude_temp.py ``` - Analyzes surface temperatures across multiple altitudes - Creates animated plot showing temperature vs altitude over time - **Output:** `outputs/multi_altitude_temp_evolution.mp4`

### Usage Examples ```bash # Default (uses dumps/ folder) python3 scripts/animate_particles.py python3 scripts/surface_temp_heatmap.py python3 scripts/grid_temp_heatmap.py python3 scripts/velocity_heatmap.py # Analyze specific altitude results python3 scripts/animate_particles.py dumps/alt_80km python3 scripts/surface_temp_heatmap.py dumps/alt_100km python3 scripts/grid_temp_heatmap.py dumps/alt_75km python3 scripts/velocity_heatmap.py dumps/alt_95km ``` **Note:** - Run `python3 tools/load_dumps.py ` first to convert dump data to Parquet format - Scripts now use streaming data access, preventing RAM crashes on large datasets - no more catastrophic failure :) - All output files (.mp4, .png, .csv) are saved to the `outputs/` folder ## 6. Atmospheric Data and Species Composition ### How Atmospheric Data is Generated and Used The atmospheric data pipeline works as follows: 1. **Generate NRLMSIS data:** Run `python3 tools/load_atm_data.py ` to query NRLMSIS-2.1 for atmospheric properties at the specified altitude. 2. **Data files created in `data/`:** - `nrlmsis.dat` - Full NRLMSIS dataset (70-500 km) with columns: - Altitude (km), Temperature (K), Density (kg/m³), Pressure (Pa), Pressure (Torr), Orbital Velocity (m/s), Number Density (m⁻³), n_N2, n_O2, n_O, n_He, n_Ar, n_N - `atm.sparta` - SPARTA include file with all atmospheric variables, species, and mixture definitions 3. **Contents of `data/atm.sparta`:** This file is auto-generated with all atmospheric data for the specified altitude: ``` # NRLMSIS-2.1 atmospheric data for 150 km altitude # Generated by load_atm_data.py variable rho equal 1.234567e-09 variable nrho equal 2.345678e+16 variable T equal 634.123456 variable vx equal 7814.2 species species/air.species N2 O2 O He Ar N mixture atm N2 frac 0.5379 mixture atm O2 frac 0.0379 mixture atm O frac 0.4213 mixture atm He frac 0.0005 mixture atm Ar frac 0.0009 mixture atm N frac 0.0015 ``` (Example shown for ~150 km altitude; values vary with altitude) 4. **Loading in SPARTA:** The input files simply include the file: ``` include data/atm.sparta mixture atm nrho ${nrho} vstream ${vx} 0.0 0.0 temp ${T} ``` ### Species Data Files - **`species/air.species`** - Molecular properties (mass, rotational/vibrational DOF, etc.) for N₂, O₂, O, N, He, Ar, and ionized species - **`vss/air.vss`** - Variable Soft Sphere collision parameters (diameter, omega, tref, alpha) for each species ## 7. Surface Geometry and STL Conversion ### Converting STL Files to SPARTA Surface Format SPARTA requires surface files in its native `.surf` format. Use `stl2surf.py` to convert: ```bash python3 tools/stl2surf.py models/your_model.STL surf/your_model.surf ``` **Features:** - Accepts both ASCII and binary STL files - Automatically converts mm → m (divides coordinates by 1000) - Auto-centers the model at the origin - Warns if the surface is not watertight (important for closed bodies) **Example:** ```bash python3 tools/stl2surf.py models/AMPT_sat_inlet.STL surf/AMPT_sat_inlet.surf ``` ### Available Surface Files Located in `surf/`: - `cube.surf` - 1m × 1m × 1m cube centered at origin (12 triangles) - Additional satellite configurations ## 8. Input File Configurations ### Overview The repository provides several input file templates for different simulation scenarios: **Available Templates:** | File | Description | Use Case | |------|-------------|----------| | `in.cube` | Cube geometry with auto-generated surface groups | Validation, ram vs skin friction studies | | `in.general_surface` | General geometry with uniform accommodation | Production simulations with complex geometry | | `in.auto_surf_decomp` | Automatic surface decomposition by orientation | Complex geometries requiring drag decomposition | | `in.ampt_box` | Original slender box configuration | Testing | **To run any of them, copy/rename to `in.ampt`** since all run / analysis scripts expect this filename: ```bash cp in.cube in.ampt # Use cube configuration cp in.general_surface in.ampt # Use general surface configuration cp in.auto_surf_decomp in.ampt # Use automatic decomposition ``` --- **`in.cube` (Cube with Different Face Treatments):** This configuration is used for studying ram drag vs skin friction separately on a cube geometry: ``` # Surface groups for different face orientations read_surf surf/cube.surf group ampt group ampt_xnorm surf id 1:4 # front/back faces (ram direction) group ampt_yznorm surf id 5:12 # side walls # Define collision models with different accommodation surf_collide wall_diffuse diffuse s_Tsurf 1 # α=1.0 for ram faces surf_collide wall_specular diffuse s_Tsurf 1 # α=1.0 for side walls # Apply to respective surface groups surf_modify ampt_xnorm collide wall_diffuse surf_modify ampt_yznorm collide wall_specular ``` [Jiang et al.](https://doi.org/10.1016/j.ast.2022.108077) uses this cube geometry, and tests both α=0.3 and α=1.0 (see below) for lateral surfaces. Just like in the paper, this script allows separate calculation of: - **Ram drag:** Force on x-normal faces (directly facing the flow) - **Skin friction:** Force on y/z-normal faces (parallel to flow) ### Accommodation Coefficients The accommodation coefficient (α) in the collision models determines how much a gas molecule thermalizes with the surface: - **α = 1.0 (fully diffuse):** Molecule leaves surface at surface temperature with random direction - **α = 0.0 (fully specular):** Molecule reflects like a billiard ball, conserving tangential momentum - **0 < α < 1:** Partial accommodation (interpolated behavior) You can modify the α values in the input files by changing the parameter in the `surf_collide` commands. For example, changing `diffuse s_Tsurf 0.9` to `diffuse s_Tsurf 0.5` reduces accommodation from 90% to 50%. **`in.general_surface` (General Purpose - Uniform Accommodation):** For simulating arbitrary geometries where face-specific treatment is not needed: ``` surf_collide wall diffuse s_Tsurf 0.9 # Single collision model, α=0.9 surf_modify ampt collide wall # Apply to all surfaces ``` This is ideal if you plan to simply upload an STL file, convert it to surf, and run the simulation. **`in.auto_surf_decomp` (Automatic Surface Decomposition):** For complex geometries where you want to automatically identify and separate ram vs lateral surfaces: ``` # Automatically generate surface groups using normal vector analysis read_surf surf/your_geometry.surf group ampt include surf/your_geometry.sparta # auto-generated groups ``` Run `python tools/auto_surf_decomp.py surf/your_geometry.surf` to generate the `.sparta` file containing: - **ampt_xnorm:** Surfaces with normals pointing in -x direction (ram surfaces, nx < -0.9) - **ampt_yznorm:** Surfaces with normals perpendicular to x-axis (lateral surfaces, |nx| < 0.1) - **ampt_diffuse:** All remaining unclassified triangles (assigned diffuse collision model) The tool analyzes triangle normals and classifies surfaces by orientation, enabling drag decomposition on arbitrary geometries without manual surface ID assignment. Any triangles not matching the ram or lateral criteria are automatically assigned to the diffuse group with accommodation coefficient α = 0.9. This works best for surfaces with discrete ram/lateral facing triangles. **Recommendation:** - Use `in.cube` for basic validation studies ([Jiang et al.](https://doi.org/10.1016/j.ast.2022.108077)) - Use `in.general_surface` for simulations with uniform surface properties - Use `in.auto_surf_decomp` for complex geometries requiring automatic drag decomposition - Use `in.ampt_box` to modify the original test file for this toolbox ### Scripts That Depend on Input Files The following scripts explicitly use `in.ampt` as the input file: - **`run_sparta.sh`** - Runs `sparta -in in.ampt` - **`multi_altitude.py`** - Runs simulations using `in.ampt` - **`scripts/surface_temp_heatmap.py`** - Reads timestep from `in.ampt` - **`scripts/grid_temp_heatmap.py`** - Reads timestep from `in.ampt` - **`scripts/velocity_heatmap.py`** - Reads timestep from `in.ampt` - **`scripts/streamlines.py`** - Reads timestep from `in.ampt` - **`scripts/plot_drag.py`** - Reads domain size from `in.ampt` **To use these scripts:** Copy your desired input template to `in.ampt`: ```bash cp in.general_surface in.ampt # Example ``` ## 9. Drag Calculation Methods The `plot_drag.py` script computes drag using the direct sum method: ### Direct Sum Method Forces on each surface element are computed directly by SPARTA and summed: ``` compute surfF surf ampt atm fx fy fz fix surfavg ave/surf ampt 1 1 1 c_surfF[*] ave running compute drag reduce sum f_surfavg[1] ``` This method: - Sums x-component of forces imparted by gas molecules on all surface faces - Uses running time-average to reduce statistical noise - **Has been validated analytically and against existing literature to within ~0.2%** For the decomposed geometries (`in.cube` and `in.auto_surf_decomp`), drag is decomposed into: - **Ram drag:** Forces on x-normal faces (front/back) - **Skin friction:** Forces on y/z-normal faces (side walls) The alternate momentum flux method was removed in commit 21a5f46 due to issues with SPARTA's transparent surface flux measurement. ### Output Files Drag data is written to: - `dumps/direct_drag.dat` - Timestep, total drag, ram drag, skin friction ## 10. Best Practices ### Clearing Dump Files Between Runs **Important:** It is good practice to clear the `dumps/` folder between simulation runs: ```bash rm -f dumps/*.dat ``` The `in.ampt` files automatically do this at the start of each run (make sure you don't leave any important .dat files in there!): ``` shell "bash -c 'rm -f dumps/*.dat'" ``` However, if dumps are stored in subfolders (e.g., `dumps/alt_100km/`), they will persist. This is useful for keeping multi-altitude results. **Note:** Parquet files (`.parquet`) from previous conversions will also persist. Delete them manually if needed. These should probably be auto deleted during load_dumps.py, but it's late and I'm too tired to think about the implications of doing so. Submit a pull request if you have a problem with it, or anything else for that matter: ```bash rm -f dumps/*.parquet ``` ### Simulation Constraints SPARTA will output the required constraints at startup: ``` CELL SIZE MUST BE LESS THAN X m TIMESTEP MUST BE < Y s ``` Ensure your grid resolution and timestep satisfy these constraints for accurate results. My source for these values is a YouTube video that has been lost to the depths of my watch history. It has a guy lecturing over powerpoint slides and one of them has a cow in it. You can also just modulate these parameters until things converge, but this is a good starting point and to be honest I trust the guy with the microphone more. ### Memory Management - Use `tools/load_dumps.py` to convert raw dumps to Parquet before analysis - Analysis scripts use streaming access to avoid loading entire datasets into RAM - For very large simulations (>10GB dumps), consider analyzing one altitude at a time ### Validation Direct drag calculations have been validated: - Analytically against kinetic theory predictions - Against existing literature to within ~0.2% accuracy For critical applications, perform convergence studies by varying: - Grid resolution - Number of particles - Timestep size - Simulation duration ## 11. Running on Penn State Roar Supercomputer ![Penn State](https://icds.psu.edu/wp-content/uploads/2024/12/psu-mark-footer-1.png) For accuracy beyond the free molecular regime and into continuum, it is likely necessary to run the simulation on a cluster. These instructions are for Penn State's Roar cluster (Slurm scheduler). If you're SSH'd into the submit node, you can submit batch jobs to run SPARTA on compute nodes. ### Prerequisites Make sure SPARTA is built on Roar and the symlink is in place: ```bash ls -la sparta # should point to ../sparta/src/spa_mpi ``` Load the required modules (must match what SPARTA was compiled with): ```bash module load gcc/14.2.0 module load openmpi/4.1.1-pmi2 ``` ### Job Script A ready-to-use Slurm script is provided in `job_sparta.sh`: ```bash #!/bin/bash #SBATCH --job-name=sparta_ampt #SBATCH --account=open #SBATCH --partition=open #SBATCH --nodes=1 #SBATCH --ntasks=8 #SBATCH --time=04:00:00 #SBATCH --mem=16G #SBATCH --output=slurm_%j.out #SBATCH --error=slurm_%j.err module load gcc/14.2.0 module load openmpi/4.1.1-pmi2 cd $SLURM_SUBMIT_DIR # Run SPARTA with MPI mpirun -np $SLURM_NTASKS ./sparta -in in.ampt_box_Roar ``` Edit the script to change: - **Cores**: Change `--ntasks=8` (up to 48 per node) - **Wall time**: Change `--time=04:00:00` for longer simulations - **Memory**: Change `--mem=16G` if needed ### Available Partitions | Partition | Cores/node | RAM/node | Notes | |-----------|-----------|----------|-------| | **basic** | 64 | 247 GB | Most nodes, shortest queue wait | | **standard** | 48 | 375 GB | | | **himem** | 48 | ~1 TB (1021 GB) | Fewest nodes, longest queue wait | All partitions have a 14-day max wall time. Use `sinfo -p -o "%D %T" | grep idle` to check idle node count before submitting. ### Estimating Memory Requirements For a back-of-envelope estimate of how much RAM your job will need, use **152 bytes per cell** and **104 bytes per particle**, multiplied by a load factor of ~2 to cover ghost cells, MPI buffers, and SPARTA's internal bookkeeping: $$ \text{memory (GB)} = \frac{152 \cdot N_{\text{cells}} + 104 \cdot N_{\text{particles}}}{10^9} \cdot \text{load factor} $$ For example, a 250M-cell grid with 500M particles is (152·2.5×10⁸ + 104·5×10⁸) / 10⁹ · 2 ≈ **180 GB**, which is just above the ~132 GB observed on the basic-partition benchmark below (load factor for that run was closer to 1.5 because `gridcut 0.03` keeps ghost overhead modest). For the himem benchmark (960M cells, ~1.95B particles), the formula gives (152·9.6×10⁸ + 104·1.95×10⁹) / 10⁹ · 2 ≈ **697 GB**, which matches the ~674 GB observed (effective load factor ≈ 1.93, since `gridcut 0.01` plus billion-particle MPI buffers push ghost/communication overhead up to ~60%). Pick a partition whose RAM/node comfortably exceeds this estimate, or split across nodes. ### Estimating Credit Cost Roar provides several command-line utilities for credit accounting, including: - `get_balance` — shows your **current credit balance** across all accounts. - `job_estimate ` — predicts credits **before** submitting, from the resource requests in a Slurm script. - `credit_estimate -j ` — reports credits **actually consumed** by a completed job. Rough formula (Basic core-month ≈ 1 credit ≈ $2.96, prorated by runtime): $$ \text{credits} \approx \frac{N_{\text{cores}} \cdot t_{\text{hours}} \cdot \text{multiplier}}{720} $$ The multiplier depends on partition: Basic = 1, Standard = 1.99, High Memory = 2.78, plus higher rates for GPU partitions. The full credit/allocation pricing table is on the ICDS site: [Roar Credit Pricing](https://icds.psu.edu/services/roar/details-rates/) **[ICDS Roar — Service Details and Rates](https://icds.psu.edu/services/roar/details-rates/)** | Compute Type | Credit Multiplier | Allocation $/core/month | |---|---|---| | Basic | 1.00 | $2.96 | | Standard | 1.99 | $5.89 | | High Memory | 2.78 | $8.23 | | P100 GPU | 4.99 | $179.70* | | A100 (full) | 50.55 | $291.00* | | A100 (half) | 25.28 | $145.50* | | A100 MIG slice | 7.22 | — | | A40 GPU | 50.48 | — | | V100 GPU | 22.68 | — | A single credit costs **$2.96**. GPU allocation prices include the bundled Standard cores (28 for P100, 24 for full A100, 12 for half A100). Storage is billed separately (Active Group $3.09, Archive $1.34 per TB/month). See the [interactive cost estimator](https://icds.psu.edu/services/roar/details-rates/estimator/) for monthly budgeting. **Free credits through the READ program:** Every Roar account holder gets a baseline of **3 READ Credits per month** (use-or-lose, deposited in your personal `open` account; submit jobs with `--account=open` to spend them). That's enough for incidental testing but will not cover a production VLEO-DSMC run. For unfunded or under-funded research, Penn State faculty (PIs) can [**apply for additional subsidized READ "discovery" Credits**](https://icds.psu.edu/services/roar/read-credits/); these are sharable across a group and valid for up to a year. If you need more than that, or aren't eligible for READ, you can [**purchase Credits or an Allocation via iLab**](https://icds.psu.edu/services/roar/details-rates/). ### Benchmark (70 km altitude, basic partition) Tested on Roar `basic` partition (64 cores, 240 GB RAM requested): | Parameter | Value | |-----------|-------| | Grid | 1000 × 500 × 500 = **250M cells** | | Particles | Ns_target = 500M → **~498M created** | | `global gridcut` | 0.03 (required for large grids) | | Memory per proc | ~2.06 GB avg (2.06 GB × 64 procs = **~132 GB total**) | | Speed | **~1.0 s/timestep** (64 cores) | | 8000 steps | ~2.2 hours | **Important:** For large grids (>100M cells), you **must** use `global gridcut` and `block * * *` in `create_grid` to avoid each MPI rank storing all ghost cells (which will OOM). Example: ``` global gridcut 0.03 # each processor only stores ghost cells within 0.03 m of its own cells create_grid 1000 500 500 block * * * ``` Without these, a 250M-cell grid with 1.25B particles requires >1 TB of RAM. With them, the same grid fits in ~132 GB. ### Benchmark (70 km altitude, himem partition) Tested on Roar `himem` partition (48 cores, 950 GB RAM requested): | Parameter | Value | |-----------|-------| | Domain | 1.5 × 1.0 × 1.0 m (±0.75 x, ±0.5 y/z) | | Grid | 1500 × 800 × 800 = **960M cells** | | Particles | Ns_target = 2B → **~1.95B created** | | `global gridcut` | 0.01 (reduced from 0.03 to cut ghost overhead) | | `global mem/limit` | 1024 (splits MPI messages >1 GB to avoid 2 GB overflow) | | Ghost overhead | ~61.5% | | Memory per proc | ~14.0 GB avg (14.0 GB × 48 procs = **~674 GB total**) | | Speed | **~7.06 s/timestep** (48 cores) | | 8000 steps | ~15.7 hours | **Differences from Basic:** - Use `gridcut 0.01` (not 0.03) to reduce ghost cell overhead; each proc stores ghosts within 1 cm instead of 3 cm - Use `global mem/limit 1024` to prevent MPI send buffer overflow (>2 GB) for billion-cell grids - 2.78× credit multiplier vs basic. Monitor usage with `credit_estimate -j ` Example configuration: ``` global gridcut 0.01 global mem/limit 1024 create_grid 1500 800 800 block * * * ``` ### Submitting and Monitoring Jobs ```bash # Submit a job sbatch job_sparta.sh # Check job status squeue -u $USER # View full output less slurm_.out # Watch output in real time (while job is running) tail -f slurm_.out watch -n 1 squeue -u $USER # updates every 1 second # Cancel a job scancel # Check for errors after a job finishes (or crashes) cat slurm_.err # View past job details sacct -j --format=JobID,JobName,Elapsed,State,MaxRSS ``` ### Scratch Storage for Dump Files Roar home directory has a **16 GB quota**. Large simulations can produce dump files of 30+ GB per timestep, which will fill your home and cause jobs and file saves to fail. Use scratch storage instead by replacing the `dumps/` directory with a symlink: ```bash rm -rf dumps mkdir -p /scratch/$USER/VLEO-DSMC/dumps ln -s /scratch/$USER/VLEO-DSMC/dumps dumps ``` All scripts continue to work unchanged; they still write to `dumps/`, but the data goes to `/scratch` (50 TB quota). **Scratch files are purged after ~30 days of inactivity**, so copy important results back to home when done. Good luck - and most importantly, have fun! ## 12. DSMC vs. Theory agreement The purpose of this section is to test agreement between DSMC and theoretical drag calculations. This can be compared to the DSMC data from actual runs, along with simulation and computational requirements for detailed analysis. Run data is recorded in [`data/ampt_box_log.tsv`](data/ampt_box_log.tsv) and can be plotted against the `scripts/Ethan_drag_theory.py` file, which gives predicted drag for the 'ampt_box.surf' file using the same `nrlmsis_Ethan` data by combining free molecular and continuum drag equations. The recorded data is also presented below. The table is populated automatically by `tools/log_run.py`, which parses `log.sparta`, `in.ampt_box_Roar`, `job_sparta.sh`, and `sacct $SLURM_JOB_ID` (for credits) then appends a new row to the TSV and rewrites the markdown table between the `AMPT_BOX_LOG` markers below. Each run produces one new row so history is preserved. `job_sparta.sh` invokes the script automatically after `mpirun` finishes — Slurm sets `$SLURM_JOB_ID` in the job's environment so credits get filled in. Running it manually from a login shell still works (TSV/README get updated), but the credits column will be blank since there's no associated job. To refresh manually: ```bash python tools/log_run.py ``` | altitude | drag | c_d | cell size (req/actual) | timestep (req/actual) | cells | particles | ppc | partition | cores | speed/step | total steps | runtime | memory | credits used | |---|---|---|---|---|---|---|---|---|---|---|---|---|---|---| | 300 | 0.000169 | 2.36 | 439/0.01 | 0.145/1e-07 | 150x100x100 | 10M | 6.67 | himem | 48 | 28.82 ms | 8000 | 00:03:53 | | 0.009 | | 200 | 0.00132 | 2.54 | 61.5/0.01 | 0.0223/1e-07 | 150x100x100 | 10M | 6.67 | himem | 48 | 23.05 ms | 5000 | 00:01:58 | | 0.003085 | | 100 | 1.89 | 2.65 | 0.0455/0.01 | 4.15e-05/1e-07 | 150x100x100 | 10M | 6.67 | himem | 48 | 19.77 ms | 5000 | 00:01:41 | | 0.003085 | | 80 | 53.2 | 2.39 | 0.00146/0.01 | 1.26e-06/1e-07 | 150x100x100 | 10M | 6.67 | himem | 48 | 24.18 ms | 5000 | 00:02:03 | | 0.006170 | | 120 | 0.0709 | 2.53 | 1.15/0.01 | 0.000724/1e-07 | 150x100x100 | 10M | 6.67 | himem | 48 | 21.51 ms | 5000 | 00:01:50 | | 0.003085 | | 100 | 1.81 | 2.54 | 0.0455/0.001667 | 4.15e-05/1e-07 | 900x600x600 | 2000M | 6.17 | himem | 48 | 8070.25 ms | 1000 | 02:15:21 | | 0.416506 | | 100 | 1.81 | 2.54 | 0.0455/0.001667 | 4.15e-05/1e-07 | 900x600x600 | 2000M | 6.17 | himem | 48 | 8163.79 ms | 1000 | 02:16:57 | | 0.419591 | | 90 | 11.3 | 2.58 | 0.00741/0.0015 | 6.71e-06/1e-07 | 1000x667x667 | 2000M | 4.50 | himem | 48 | 9814 ms | 3600/10000 | 10:00:25 | 451 GB | 1.851138 |

_{DSMC results overlaid on FMF and continuum theoretical drag predictions (from scripts/Ethan_drag_theory.py).}