Gesture interaction for 3D creation and VR

Precision Gesture Interaction

Precise, stable, and built for real spatial work.

xtan is a gesture interaction framework that converts tracked motion into structured commands for professional 3D software. Instead of focusing on visual hand tracking alone, xtan emphasizes motion consistency, spatial structure, and timing. The goal is to provide reliable gesture input for real production workflows in 3D creation, VR, and spatial applications.

xtan gesture interaction for 3D authoring

A demo video will be available on YouTube soon

What’s different

Most hand-tracking systems are built for visual realism or casual interaction. They often rely on RGB cameras and AI inference to estimate a hand pose and guess a likely meaning. xtan takes a different approach by focusing on motion consistency, spatial structure, and timing.

Hello icon

Beyond visual pose guessing

Most AI vision systems can easily recognize a simple gesture like “hello”, but the underlying motion, timing, and intent may still remain unclear. xtan interprets gestures as motion patterns rather than static poses.

Spatial icon

Built on spatial tracking

Instead of relying purely on image-based estimation, xtan uses precise spatial tracking data to create stable, repeatable, and more reliable gesture input for professional tools and demanding interactive workflows.

Workflow icon

Designed for real workflows

The goal is not novelty interaction, but structured, reproducible input that fits demanding 3D and VR workflows, where stability, consistency, and reliable control are more important than visual effect.

Why use xtan

Many professionals spend hours working in 3D environments such as Blender or Unreal Engine. Traditional interaction often requires constant switching between tools, menus, and keyboard shortcuts. xtan introduces gesture-based interaction to reduce friction and streamline creative workflows.

Efficiency icon

More efficient

Perform actions with fewer steps, reduce repeated tool switching, and interact with tools more directly during modeling, editing, and spatial workflows.

Focus workflow icon

More focus

Reduce interruptions, avoid repeated context switching, and stay focused on the creative task instead of constantly navigating tools and interface layers.

Real-time interaction icon

Real-time and reproducible

Low latency interaction, predictable behavior, and undo-safe integrations help ensure gesture input remains stable, consistent, and reliable.

What can you do with xtan?

xtan is designed for creators and developers who work daily in complex 3D environments. It helps reduce workflow friction and introduces more direct spatial interaction for real production tasks.

Creator workflow icon

Built for creators

xtan supports creators who spend long hours inside demanding 3D workflows. It helps reduce repeated tool switching, streamline common actions, and keep attention on the actual creative task.

Natural interaction icon

Natural and fluid spatial interaction

xtan adds a spatial interaction layer that makes work in 3D space feel more direct and fluid, with less friction between real-world motion, user intention, and application response.

Tracked 3D pen icon

Simple tools, precise input

xtan can even turn a simple tracked pen into a precise spatial input tool. With a small built-in trigger button, for example via Bluetooth, lightweight tools can become stable and repeatable spatial controllers.

Additional integrations

xtan can also be used with other creative and engineering software, with more integrations possible over time.

FreeCAD icon
LibreCAD iconQCAD icon

Current interest

FreeCAD, LibreCAD, and QCAD are natural candidates for gesture-assisted workflows.

SteamVR icon
SolidWorks iconAdobe icon

Potential future targets

SteamVR, SolidWorks, Adobe, and Godot are also possible directions for future integration.

Open source icon
Modular iconArchitecture icon

Open architecture

The system is intended to remain modular, so integrations can grow alongside the ecosystem.

Technologies we love

xtan is built on a modern stack focused on performance, openness, and portability. These technologies enable reliable spatial tracking, efficient processing, and flexible deployment across different hardware platforms.

Programming languages

The core implementation focuses on efficient systems programming and performance-critical components. High-level languages are used where flexibility and rapid development are beneficial.

Hardware support

xtan runs on standard compute hardware. Tracking can be performed on edge devices such as Raspberry Pi 5, while processing is handled on a host system.

For best performance, the recommended architecture is EdgeTrack — a modular open-source stereo tracking system designed specifically for precise spatial motion capture.

How it works

Tracking → Fusion → Gesture Interpretation → Application Command

Stereo tracking camera

EdgeTrack

Captures precise stereo tracking data and extracts spatial motion information directly from synchronized camera views.

Multi-view fusion

CoreFusion

Combines multiple camera views and improves tracking stability through spatial fusion and temporal consistency.

Gesture mapping

MotionCoder

Interprets motion patterns as structured gestures and maps them to commands for 3D software and spatial applications.

See the full architecture diagrams and pipeline variants in the docs.

Community & updates

Join the xtan community and receive updates about development progress, releases, and hardware availability.

Access community and documentation updates with a simple email login.

No spam. Magic-link login. Unsubscribe anytime.

Logo inspiration

Originally, I wanted a short and memorable .ai domain, ideally with only four letters. Since most four-letter .ai domains were already registered, I chose xtan.ai when I found it still available.

The logo is based on a simple mathematical construction: a straight line and a tangent curve that together form a stylized X with an inner flowing structure.

Instead of being manually illustrated, the shape was generated programmatically with CSS and JavaScript. The concept can be visualized by plotting y = 1.5x and y = tan(-0.5x) over the interval -π < x < π.

Note

This section is still new and currently under development. I will continue refining it step by step. Thank you for your understanding and patience.