原文
✅ Can it pick the right tool with correct parameters?
✅ Can it classify "masked person at night" as Critical?
✅ Can it resist prompt injection in event descriptions?
✅ Can it deduplicate the same person across 3 cameras?
✅ Can it maintain context across multi-turn security conversations?