Our trained checkpoints in the paper "On the Consistency of Video Large Language Models in Temporal Comprehension".