Video Demos

Demo 1 (×5 speed)

Demo 2 (×5 speed)

Demo 3 (×5 speed)

Demo 4 (×5 speed)

Demo 5 (×5 speed)

All videos are shown at ×5 speed unless otherwise stated.

PalletFit: Stability-Aware Online 3D Bin Packing via Transformer-Based Reinforcement Learning

Jinkyo Chung¹, BongKyu Jeon¹, Ju-Hwan Seo²
¹AI Laboratory, Uon Robotics, Daejeon, Republic of Korea · ²CJ Logistics, Seoul, Republic of Korea
Emails: jinkyo.chung@gmail.com, bongkyu.jun@uonrobotics.com, seojh1989@gmail.com

Links

Code (Coming Soon) BibTeX

TL;DR

Online stability-aware 3D bin packing for variable-sized items: EDP candidate generation → Transformer RL joint item+placement selection under feasibility constraints → Jam-Motion execution for robust real-world packing.

EDP candidates Feasibility constraints FT-Transformer RL Jam-Motion

Abstract

Industrial palletizing increasingly requires stable 3D bin packing of items with diverse sizes. However, prior studies often restrict size variability and overlook physical stability, resulting in tilted or toppled placements and product damage. We introduce PalletFit, an online stability-aware 3D bin packing framework designed for variable-sized items, unifying planning and execution through the Generation, Selection, and Execution stages. In the Generation stage, we propose Edge-Projection (EDP), which discretizes the continuous placement space into a compact discrete candidate set using lightweight post-processing. In the Selection stage, a Feature-Tokenizer Transformer-based Reinforcement Learning (RL) agent jointly selects items and placements under strict heuristic feasibility constraints to ensure physical realizability. In the Execution stage, we employ a robot motion primitive, Jam-Motion, to compensate for sensing and control errors through final contact alignment before release. Experimental results demonstrate that PalletFit markedly outperforms existing baselines, achieving dense and physically stable packing. Real-world experiments on an industrial robot further confirm robust execution with no observed stability violations.

Videos are publicly available above. Code will be released upon paper acceptance.

Method Overview

Generation: Edge-Projection (EDP) turns continuous placement space into a compact candidate set.
Selection: Feature-Tokenizer Transformer RL jointly selects item + placement with action masking under feasibility constraints.
Execution: Jam-Motion aligns by contact to absorb sensing/control errors.

Reproducibility

Placeholder commands (update later when you release code).

# Coming soon
# We will provide installation and evaluation instructions upon release.

Code Availability

The code will be released publicly upon paper acceptance.

Citation

@article{chung2026palletfit,
  title   = {PalletFit: Stability-Aware Online 3D Bin Packing via Transformer-Based Reinforcement Learning},
  author  = {Chung, Jinkyo and Jeon, BongKyu and Seo, Ju-Hwan},
  journal = {arXiv preprint},
  year    = {2026}
}

Update the venue/arXiv info when finalized.