Content Moderation of Surveillance Search Queries Using Fine-Tuned Generative LLMs
We study how small, fine-tuned generative large language models (LLMs) can moderate free-text search queries for surveillance video systems. Four open models, Llama 3.2 1B, Llama 3.2 3B, Qwen 2.5 0.5B, and 1.5 B, are trained on six subtasks: safety judgement, problem detection, target detection, span detection, safety explanation, and rephrase. The training combines a public toxicity set with abou