pose_detection

Flutter plugin for on-device, multi-person pose detection and landmark estimation using TensorFlow Lite. Uses YOLOv8n for person detection and Google's BlazePose for 33-keypoint landmark extraction.

[Example Screenshot]

Quick Start #

import 'dart:io';
import 'dart:typed_data';
import 'package:pose_detection/pose_detection.dart';

Future main() async {
  // One-step construction and initialization
  final PoseDetector detector = await PoseDetector.create(
    mode: PoseMode.boxesAndLandmarks,
    landmarkModel: PoseLandmarkModel.heavy,
  );

  // Load and detect from image bytes
  final Uint8List imageBytes = await File('image.jpg').readAsBytes();
  final List<Pose> results = await detector.detect(imageBytes);

  // Access results
  for (final Pose pose in results) {
    final BoundingBox bbox = pose.boundingBox;
    print('Bounding box: (${bbox.left}, ${bbox.top}) → (${bbox.right}, ${bbox.bottom})');
    print('Size: ${bbox.width} x ${bbox.height}, center: (${bbox.center.x}, ${bbox.center.y})');

    if (pose.hasLandmarks) {
      // Iterate over landmarks
      for (final PoseLandmark lm in pose.landmarks) {
        print('${lm.type}: (${lm.x.toStringAsFixed(1)}, ${lm.y.toStringAsFixed(1)}) vis=${lm.visibility.toStringAsFixed(2)}');
      }

      // Access landmarks individually
      // See "Pose Landmark Types" section in README for full list of landmarks
      final PoseLandmark? leftKnee = pose.getLandmark(PoseLandmarkType.leftKnee);
      if (leftKnee != null) {
        print('Left knee visibility: ${leftKnee.visibility.toStringAsFixed(2)}');
      }
    }
  }

  // Clean up
  await detector.dispose();
}

Alternatively, construct and initialize separately if you need to configure between steps:

final PoseDetector detector = PoseDetector(
  mode: PoseMode.boxesAndLandmarks,
  landmarkModel: PoseLandmarkModel.heavy,
);
await detector.initialize();

Refer to the sample code on the pub.dev example tab for a more in-depth example.

Migration Guide #

3.0.0: `detectFromMat` signature change #

The imageWidth and imageHeight named arguments have been removed. Dimensions are now read directly from the Mat.

// Before (2.x)
final poses = await detector.detectFromMat(
  mat,
  imageWidth: mat.cols,
  imageHeight: mat.rows,
);

// After (3.0)
final poses = await detector.detectFromMat(mat);

Web (Flutter Web) #

This package supports Flutter Web using the same package import:

import 'package:pose_detection/pose_detection.dart';

Two web runtimes are available, selectable per PoseDetector:

Default — tflite-js (CPU/WASM). No setup beyond initialization. Same path that ships in older versions.
LiteRT.js with WebGPU delegate. Google's official web runtime via flutter_litert ≥ 2.5.0. ~18× faster than the default in real measurements (446 ms → 25 ms / call on the heavy BlazePose model with mixed single/multi-person images). Requires a small loader script in web/index.html.

The main difference from native is how you load images:

The Quick Start example above uses dart:io (File(...)), which is not available on web.
On web, load an image as Uint8List (for example from a file picker, drag-and-drop, or network response) and call detect(imageBytes).
detectFromMat(...) (OpenCV cv.Mat) is native-only and is not available on web.
interpreterPoolSize, performanceConfig, and useNativePreprocessing are accepted for API compatibility but are ignored on web.

final detector = await PoseDetector.create(
  mode: PoseMode.boxesAndLandmarks,
  landmarkModel: PoseLandmarkModel.heavy,
);

final List<Pose> poses = await detector.detect(imageBytes);

await detector.dispose();

Web (LiteRT.js + WebGPU, recommended) #

Just opt in at construction time — the runtime is auto-loaded from a CDN on first use, no web/index.html changes required:

final detector = await PoseDetector.create(
  mode: PoseMode.boxesAndLandmarks,
  landmarkModel: PoseLandmarkModel.heavy,
  useLiteRt: true,
  liteRtAccelerator: 'webgpu', // 'wasm' on browsers without WebGPU
);

If webgpu compile fails for any op, the runtime automatically falls back to wasm (still substantially faster than the default tflite-js path because LiteRT.js's WASM is SIMD-optimized).

If you need to self-host the runtime (offline, strict CSP, or to pin a specific build), call flutter_litert's configureLiteRtLoader(moduleUrl: ..., wasmUrl: ...) before any PoseDetector.create, or set autoLoad: false and load it from your own <script> tag instead.

Benchmarks #

Heavy BlazePose model on macOS Chrome 147, 5 images, 10 timed iterations each, averaged over 2 runs (see runWebBenchmark.sh):

Image	Detections	Default (tflite-js)	LiteRT.js webgpu	Speedup
pose1	1	357 ms	20 ms	17.8×
pose2	1	357 ms	18 ms	19.9×
pose3	2	430 ms	23 ms	18.7×
pose4	6	726 ms	46 ms	15.9×
pose5	1	360 ms	17 ms	20.7×
mean	—	446 ms	25 ms	~18×

Detection counts are identical between the two runtimes on every image.

Separate `example_web` app #

The repository keeps the browser demo in example_web/ (separate from example/) because the web sample uses browser-specific APIs (HTML file picker + canvas overlay) and UI flow. The demo is wired with useLiteRt: true, liteRtAccelerator: 'webgpu' and the loader snippet in web/index.html — copy from there as a starting point.

Run the web demo locally:

cd example_web
flutter pub get
flutter run -d chrome

Build for web:

cd example_web
flutter build web

Performance #

Hardware Acceleration #

The package automatically selects the best acceleration strategy for each platform:

Platform	Default Delegate	Speedup	Notes
macOS	XNNPACK	2-5x	SIMD vectorization (NEON on ARM, AVX on x86)
Linux	XNNPACK	2-5x	SIMD vectorization
iOS	Metal GPU	2-4x	Hardware GPU acceleration
Android	XNNPACK	2-5x	ARM NEON SIMD acceleration
Windows	XNNPACK	2-5x	SIMD vectorization (AVX on x86)

No configuration needed, just call initialize() and you get the optimal performance for your platform.

Advanced Performance Configuration #

// Auto mode (default), optimal for each platform
await detector.initialize();

// Force XNNPACK (all native platforms)
final detector = PoseDetector(
  performanceConfig: PerformanceConfig.xnnpack(numThreads: 4),
);
await detector.initialize();

// Force GPU delegate (iOS recommended, Android experimental)
final detector = PoseDetector(
  performanceConfig: PerformanceConfig.gpu(),
);
await detector.initialize();

// CPU-only (maximum compatibility)
final detector = PoseDetector(
  performanceConfig: PerformanceConfig.disabled,
);
await detector.initialize();

Pose Detection Modes #

This package supports two operation modes that determine what data is returned:

Mode	Description	Output
boxesAndLandmarks (default)	Full two-stage detection (YOLO + BlazePose)	Bounding boxes + 33 landmarks
boxes	Fast YOLO-only detection	Bounding boxes only

Use boxes-only mode for faster detection #

When you only need to detect where people are (without body landmarks), use PoseMode.boxes for better performance:

final PoseDetector detector = PoseDetector(
  mode: PoseMode.boxes,  // Skip landmark detection
);
await detector.initialize();

final List<Pose> results = await detector.detect(imageBytes);
for (final Pose pose in results) {
  print('Person detected at: ${pose.boundingBox}');
  print('Detection confidence: ${pose.score.toStringAsFixed(2)}');
  // pose.hasLandmarks will be false
}

Bounding Boxes #

The boundingBox property returns a BoundingBox object representing the pose bounding box in absolute pixel coordinates. The BoundingBox provides convenient access to corner points, dimensions (width and height), and the center point.

Accessing Corners #

final BoundingBox boundingBox = pose.boundingBox;

// Access individual corners by name (each is a Point with x and y)
final Point topLeft     = boundingBox.topLeft;       // Top-left corner
final Point topRight    = boundingBox.topRight;      // Top-right corner
final Point bottomRight = boundingBox.bottomRight;   // Bottom-right corner
final Point bottomLeft  = boundingBox.bottomLeft;    // Bottom-left corner

// Access coordinates
print('Top-left: (${topLeft.x}, ${topLeft.y})');

Additional Bounding Box Parameters #

final BoundingBox boundingBox = pose.boundingBox;

// Access dimensions and center
final double width  = boundingBox.width;     // Width in pixels
final double height = boundingBox.height;    // Height in pixels
final Point center = boundingBox.center;  // Center point

// Access coordinates
print('Size: ${width} x ${height}');
print('Center: (${center.x}, ${center.y})');

// Access all corners as a list (order: top-left, top-right, bottom-right, bottom-left)
final List<Point> allCorners = boundingBox.corners;

Pose Landmark Models #

Choose the model that fits your performance needs:

Model	Speed	Accuracy
lite	Fastest	Good
full	Balanced	Better
heavy	Slowest	Best

Pose Landmark Types #

Every pose contains up to 33 landmarks that align with the BlazePose specification:

nose
leftEyeInner
leftEye
leftEyeOuter
rightEyeInner
rightEye
rightEyeOuter
leftEar
rightEar
mouthLeft
mouthRight
leftShoulder
rightShoulder
leftElbow
rightElbow
leftWrist
rightWrist
leftPinky
rightPinky
leftIndex
rightIndex
leftThumb
rightThumb
leftHip
rightHip
leftKnee
rightKnee
leftAnkle
rightAnkle
leftHeel
rightHeel
leftFootIndex
rightFootIndex

// Example: how to access specific landmarks
// PoseLandmarkType can be any of the 33 landmarks listed above.
final PoseLandmark? leftHip = pose.getLandmark(PoseLandmarkType.leftHip);
if (leftHip != null && leftHip.visibility > 0.5) {
    // Pixel coordinates in original image space
    print('Left hip position: (${leftHip.x}, ${leftHip.y})');

    // Depth information (relative z-coordinate)
    print('Left hip depth: ${leftHip.z}');
}

Drawing Skeleton Connections #

The package provides poseLandmarkConnections, a predefined list of landmark pairs that form the body skeleton. Use this to draw skeleton overlays:

import 'package:flutter/material.dart';
import 'package:pose_detection/pose_detection.dart';

class PoseOverlayPainter extends CustomPainter {
  final Pose pose;

  PoseOverlayPainter(this.pose);

  @override
  void paint(Canvas canvas, Size size) {
    final Paint paint = Paint()
      ..color = Colors.green
      ..strokeWidth = 3
      ..strokeCap = StrokeCap.round;

    // Draw all skeleton connections
    for (final connection in poseLandmarkConnections) {
      final PoseLandmark? start = pose.getLandmark(connection[0]);
      final PoseLandmark? end = pose.getLandmark(connection[1]);

      // Only draw if both landmarks are visible
      if (start != null && end != null &&
          start.visibility > 0.5 && end.visibility > 0.5) {
        canvas.drawLine(
          Offset(start.x, start.y),
          Offset(end.x, end.y),
          paint,
        );
      }
    }

    // Draw landmark points
    for (final landmark in pose.landmarks) {
      if (landmark.visibility > 0.5) {
        canvas.drawCircle(
          Offset(landmark.x, landmark.y),
          5,
          Paint()..color = Colors.red,
        );
      }
    }
  }

  @override
  bool shouldRepaint(covariant CustomPainter oldDelegate) => true;
}

The poseLandmarkConnections constant contains 27 connections organized by body region:

Face: Eyes to nose, eyes to ears, mouth
Torso: Shoulders and hips forming the core
Arms: Shoulders → elbows → wrists → fingers (left and right)
Legs: Hips → knees → ankles → feet (left and right)

Live Camera Detection #

For real-time pose detection with a camera feed, use detectFromCameraImage. It auto-detects YUV420 (NV12 / NV21 / I420) and desktop BGRA/RGBA layouts, and the cvtColor, optional rotate, and maxDim downscale all run inside the detector's existing isolate: the UI thread is never blocked by OpenCV work.

import 'package:camera/camera.dart';
import 'package:pose_detection/pose_detection.dart';

final detector = await PoseDetector.create(
  landmarkModel: PoseLandmarkModel.lite, // lite model for higher FPS
);

final cameras = await availableCameras();
final camera = CameraController(
  cameras.first,
  ResolutionPreset.medium,
  enableAudio: false,
  imageFormatGroup: ImageFormatGroup.yuv420,
);
await camera.initialize();

camera.startImageStream((CameraImage image) async {
  final poses = await detector.detectFromCameraImage(
    image,
    // rotation: CameraFrameRotation.cw90, // based on device orientation
    maxDim: 640, // optional in-isolate downscale before inference
  );
  // Process poses...
});

Tips for camera detection:

detectFromCameraImage replaces the old packYuv420 + manual cv.cvtColor + cv.rotate dance in one call; no cv.Mat on the UI thread.
Pass rotation: so the detector sees upright frames (Android back/front + device orientation logic); on iOS the camera plugin pre-rotates so this is often null.
Pass maxDim: (e.g. 640) to downscale in-isolate; the detection model internally resizes to 256px, so full-res frames just waste IPC bandwidth.
Use PoseLandmarkModel.lite for fastest real-time performance.
Mirror the overlay on the front camera to match CameraPreview's auto-mirrored texture.
For advanced use (e.g. reusing a frame across multiple detectors), prepareCameraFrame(...) + detectFromCameraFrame(...) is the underlying two-step API.

See the full example app for a production implementation including orientation handling, mirror handling, and frame throttling.

Background Processing #

All inference runs automatically in a background isolate: the UI thread is never blocked during detection or landmark extraction. No special configuration is needed; PoseDetector handles isolate management internally.

Advanced Usage #

Multi-person detection #

The detector automatically handles multiple people in a single image:

final List<Pose> results = await detector.detect(imageBytes);
print('Detected ${results.length} people');

for (int i = 0; i < results.length; i++) {
  final Pose pose = results[i];
  print('Person ${i + 1}:');
  print('Bounding box: ${pose.boundingBox}');
  print('Confidence: ${pose.score.toStringAsFixed(2)}');
  print('Landmarks: ${pose.landmarks.length}');
}

Interpreter Pool: The detector maintains a pool of TensorFlow Lite interpreter instances for landmark extraction. Each interpreter adds ~10MB memory overhead.

final detector = PoseDetector(
  interpreterPoolSize: 3,  // Number of interpreter instances
);

Default pool size: 1
When XNNPACK is enabled (via performanceConfig), pool size is automatically forced to 1 to prevent thread contention

pose_detection 3.0.2
pose_detection: ^3.0.2 copied to clipboard

Metadata

pose_detection

Quick Start #

Migration Guide #

3.0.0: `detectFromMat` signature change #

Web (Flutter Web) #

Web (LiteRT.js + WebGPU, recommended) #

Benchmarks #

Separate `example_web` app #

Performance #

Hardware Acceleration #

Advanced Performance Configuration #

Pose Detection Modes #

Use boxes-only mode for faster detection #

Bounding Boxes #

Accessing Corners #

Additional Bounding Box Parameters #

Pose Landmark Models #

Pose Landmark Types #

Drawing Skeleton Connections #

Live Camera Detection #

Background Processing #

Advanced Usage #

Multi-person detection #

← Metadata

Publisher

Weekly Downloads

Metadata

Topics

License

Dependencies

More

pose_detection 3.0.2 pose_detection: ^3.0.2 copied to clipboard

Metadata

pose_detection

Quick Start #

Migration Guide #

3.0.0: detectFromMat signature change #

Web (Flutter Web) #

Web (LiteRT.js + WebGPU, recommended) #

Benchmarks #

Separate example_web app #

Performance #

Hardware Acceleration #

Advanced Performance Configuration #

Pose Detection Modes #

Use boxes-only mode for faster detection #

Bounding Boxes #

Accessing Corners #

Additional Bounding Box Parameters #

Pose Landmark Models #

Pose Landmark Types #

Drawing Skeleton Connections #

Live Camera Detection #

Background Processing #

Advanced Usage #

Multi-person detection #

← Metadata

Publisher

Weekly Downloads

Metadata

Topics

License

Dependencies

More

pose_detection 3.0.2
pose_detection: ^3.0.2 copied to clipboard

3.0.0: `detectFromMat` signature change #

Separate `example_web` app #