| NOTHING | Do nothing this tick |
| CLAIM | Claim an adjacent unclaimed tile |
| DIG_OWN | Dig an owned mountain tile deeper |
| DIG_ADJ | Dig an adjacent mountain tile |
| SOLAR | Build Solar Panel (âïž energy producer) |
| WIND | Build Wind Turbine (đš constant energy) |
| BATTERY | Build Battery (đ energy storage) |
| FACTORY | Build Factory (đ converts energy â gold) |
| DRILL | Build Drill Silo (đš digs remote tiles) |
| MISSILE | Build Missile Silo (đ damages enemy buildings) |
| SHIELD | Build Shield (đĄïž protects tile from missiles) |
| UPGRADE | Upgrade best building (higher level = more output) |
| FIRE_DRILL | Fire a drill at a remote tile |
| FIRE_MISS | Fire a missile at enemy tile |
| 14-17 | ALLOC actions DISABLED â fixed [1,1,1] balanced |
| Episode Reward | Total reward accumulated during episode |
| Training Loss | Training loss (lower = more stable learning) |
| Gold | Gold at end of episode (đȘ) |
| Territory | Tiles owned at end of episode |
| Buildings | Total buildings placed |
| Epsilon | Exploration rate (1.0=random â 0.05=greedy) |
| Missile Rate | % of recent episodes with a missile silo built |
| -0.3 | DO_NOTHING (flat penalty) |
| +1.0 | CLAIM tile initiated |
| +0.5 | DIG (own or adjacent) |
| +5.0 | BUILD any building |
| +5.0 | UPGRADE building |
| +2.0 | FIRE_DRILL |
| +10 | Missile hits enemy building |
| +20 | Missile destroys enemy building |
| +2.0/tile | Territory gained (continuous) |
| +0.05/gold | Gold production (continuous) |
| +500 | Victory / -500 Defeat |
| âïž | Max 2 concurrent constructions |
| âïž | Episodes: 1500 ticks |