Long-Term Video Object Detection And Tracking In Collaborative Learning Environments