CUE-Net: Violence Detection Model

Model Description

CUE-Net (CLIP-based UniFormerV2 Enhanced Network) là mô hình phát hiện bạo lực từ video giám sát, được huấn luyện trên bộ dữ liệu RWF-2000.

Architecture

  • Backbone: CLIP ViT-L/14-336
  • Framework: UniFormerV2
  • Input: 64 frames × 336 × 336
  • Classes: 2 (Fight, NonFight)
  • Parameters: ~354M

Performance

Metric Score
Accuracy 89.50%
F1-Score 89.48%

Usage

from slowfast.models.build import build_model
from slowfast.config.defaults import get_cfg

cfg = get_cfg()
cfg.merge_from_file("config.yaml")
model = build_model(cfg)

# Load checkpoint
checkpoint = torch.load("cuenet_rwf2000_epoch51.pyth")
model.load_state_dict(checkpoint["model_state"])

Training Details

  • Optimizer: AdamW
  • Learning rate: 4e-4
  • Epochs: 51
  • Batch size: 2-4
Downloads last month
3
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support