🛠️ Steven Gong

Search

SearchSearch

May 05, 2025, 1 min read

OpenVLA-OFT

Paper that introduces finetuning for OpenVLA.

Links

  • https://openvla-oft.github.io/
  • https://arxiv.org/html/2502.19645v1
  • https://github.com/moojink/openvla-oft

Architecture

  • Llama 2 (why 2??)
  • FiLM

Two main contributions:

  1. Add parallel decoding
  2. Film for better adherence to instructions

Parallel decoding seems like an interesting way to increase inference speed.

Has some really really good visualizations of different tasks and how they fail

Graph View

Backlinks

  • Feature-wise Linear Modulation (FiLM)
  • Frame Interpolation for Large Motion (FiLM)
  • OpenVLA
  • Action Hierarchy

Created with Quartz, © 2025

  • Blog
  • LinkedIn
  • Twitter
  • GitHub