- PyTorch installed and configured with a decent GPU
- Dataset Preparation from Berkeley Deep Drive to shortlist 1000 good quality images (900 for train and 100 for validation).
- Fine-tuned a FasterRCNN model with specific target labels
- Evaluate by issuing sample queries to a test video taken from the same dataset
- Sample query: "Sample every 3 seconds and get all timestamps where there are more than 2 cars, 1 sign, 1 pedestrian"
- Momentary change in border color to green indicates the frame has satisfied the query