Autonomous Robotics Intelligence

Intelligent robots that learn, adapt, and perform.

We engineer reinforcement learning systems for autonomous robots — quadrupeds, manipulators, drones, and humanoids. From high-fidelity simulation to validated hardware deployment.

View Research Platforms Portal

01 — Platforms

Multi-platform robotics

Our RL pipelines deploy across every major robot form factor — ground, air, and bipedal systems.

PLATFORM.01

Quadrupeds

Dynamic locomotion, manipulation, and throwing policies. Phase-controlled gaits with sim-to-real transfer.

legged_locomotion // arm_manipulation

PLATFORM.02

Aerial Systems

Autonomous navigation, obstacle avoidance, and precision landing for multi-rotor platforms.

px4_autopilot // nav_avoidance

PLATFORM.03

Humanoids

Whole-body control, bipedal locomotion, and dexterous manipulation through learned policies.

bipedal_control // dexterous_manip

02 — Research

Active projects

End-to-end systems from reward design to real-world deployment.

Dynamic Arm Throwing

PPO policy for high-velocity throwing. Kinematic chain whip, phase clamping, curriculum 3m→10m, velocity auto-release, FK collision avoidance.

Isaac LabRSL-RLPPOSim2Real

12 m/s

Release Speed

AI Agent Autonomy Stack

Voice-commanded robot control with vision-language models. Person tracking, ASR, GPS, tactical bridge on edge compute.

ROS 2VLMYOLOTensorRT

Parameters

Persistent Object Tracking

Real-time multi-object tracking. 4-state machine, sparse optical flow, fisheye correction.

BoT-SORTOpenCVTensorRT

60 fps

Tracking

Edge LLM Inference

Multi-machine local model serving. Optimized quantized models for robot command interpretation.

llama.cppGGUFCUDA

125 GB

Unified Memory

GRPO Policy Optimization

Group Relative Policy Optimization. Parameter-efficient fine-tuning in under 15 minutes.

GRPOPEFTTRL

14 min

Training Time

03 — Capabilities

Core competencies

Policy Training

PPO, SAC with GPU-parallel environments in high-fidelity simulation.

Sim-to-Real Transfer

Domain randomization and system ID validated on hardware.

Computer Vision

Detection, tracking, and pose estimation for edge deploy.

Language Models

On-device LLM for robot command interpretation.

System Architecture

ROS 2, sensor fusion, behavior trees, distributed compute.

Performance Eng.

CUDA optimization, quantization, real-time tuning.

04 — Technology

Tech Stack

RL & Simulation

Isaac Lab / Sim
RSL-RL / PPO
PhysX GPU
MuJoCo
Gymnasium

Robotics

ROS 2 Humble
Robot SDKs
URDF / MJCF
Nav2 / SLAM
Behavior Trees

AI / ML

PyTorch
YOLO / TensorRT
llama.cpp
HuggingFace / PEFT
ASR / TTS

Infrastructure

CUDA 12 / 13
Docker
nginx / Tailscale
systemd
Ubuntu

05 — Hardware

Compute Fleet

Training Workstation

Desktop GPU, 16GB VRAM, 128GB system. Primary simulation training.

CUDA 12.8 // x86_64

Inference Server

125GB unified memory. Large model inference and fine-tuning.

CUDA 13.0 // ARM64

Edge Compute

64GB unified, ROS 2. Primary robot deployment platform.

ARM64 // Docker

Research Station

Desktop AI supercomputer. 3.67TB storage.

NVIDIA DGX