Temporal aggregation of vision-language features for high-accuracy fish classification in automated monitoring
Silva Martins, J. R., Bárðarson, H., Guðbrandsson, J., Einarsson, H. · Ecological Informatics · 2025
Aggregates temporal vision-language features from underwater video to classify fish species in automated monitoring streams, combining per-frame embeddings with temporal pooling to handle the noise and occlusion that limit frame-by-frame classifiers in deployed systems. Published in Ecological Informatics (2025) as part of the lab's ongoing computer-vision work with the Marine and Freshwater Research Institute.