Fish stock assessment often involves manual fish counting by taxonomy specialists, which is both time-consuming and costly. We propose FishNet, an automated computer vision system for both taxonomic classification and fish size estimation from images captured with a low-cost digital camera. The system first performs object detection and segmentation using a Mask R-CNN to identify individual fish from images containing multiple fish, possibly consisting of different species. Then each fish species is classified and the length is predicted using separate machine learning models. To develop the model, we use a dataset of 300,000 hand-labeled images containing 1.2M fish of 163 different species and ranging in length from 10cm to 250cm, with additional annotations and quality control methods used to curate high-quality training data. On held-out test data sets, our system achieves a 92% intersection over union on the fish segmentation task, a 89% top-1 classification accuracy on single fish species classification, and a 2.3cm mean absolute error on the fish length estimation task.
This study investigates a method to guide and control fish schools using virtual fish trained with reinforcement learning. We utilize 2D virtual fish displayed on a screen to overcome technical challenges such as durability and movement constraints inherent in physical robotic agents. To address the lack of detailed behavioral models for real fish, we adopt a model-free reinforcement learning approach. First, simulation results show that reinforcement learning can acquire effective movement policies even when simulated real fish frequently ignore the virtual stimulus. Second, real-world experiments with live fish confirm that the learned policy successfully guides fish schools toward specified target directions. Statistical analysis reveals that the proposed method significantly outperforms baseline conditions, including the absence of stimulus and a heuristic "stay-at-edge" strategy. This study provides an early demonstration of how reinforcement learning can be used to influence collective animal behavior through artificial agents.
Modeling fish growth is an important research topic in ecological and fishery sciences because body weight statistics directly affect the total biomass of fish in a habitat, which in turn affects their population dynamics. Many models of fish growth assume that the fish population in a habitat is homogenous, meaning that there is no physiological spectrum and, therefore, no size spectrum. Moreover, models that account for the size spectrum are not always analytically tractable. We present novel mathematical models of fish growth in which the body weight of each fish is assumed to follow a von Bertalanffy-type model whose proportionality coefficient, representing the maximum body weight, may differ among individual fish. This probabilistic description introduces the size spectrum into the model, owing to which the time-dependent probability density of this model is obtained explicitly. We also consider a misspecified version and a stochastic version of the model as advanced cases. We apply the first model to the real growth data of Plecoglossus altivelis altivelis as a keystone fish species in Japan. The model successfully reproduces the skewed size spectrum of this fish species ove
Fighting fish is a combinatorial configuration introduced by Duchi, Guerrini, Rinaldi and Schaeffer as a new model of branching surfaces that generalizes directed convex polyominoes. We come up with an alternative construction of fighting fish, using a tree structure built on the so-called stem cells of fighting fish. From this perspective, we establish a bijection between ternary trees and fighting fish with a marked strip of cells, which specializes to a direct bijection between left ternary trees and fighting fish. Using these results, we obtain a combinatorial enumeration of the fighting fish of size $n$ by establishing an $(n+1)$-to-2 bijection with the ternary trees with $n$ nodes. We present some additional enumerative results including that fighting fish with a marked tail and horizontally symmetric fighting fish are equinumerous with ordered pairs of ternary trees having a total of a given number of nodes.
What are the computational foundations of social grouping? Traditional approaches to this question have focused on verbal reasoning or simple (low-dimensional) quantitative models. In the real world, however, social preferences emerge when high-dimensional learning systems (brains and bodies) interact with high-dimensional sensory inputs during an animal's embodied interactions with the world. A deep understanding of social grouping will therefore require embodied models that learn directly from sensory inputs using high-dimensional learning mechanisms. To this end, we built artificial neural networks (ANNs), embodied those ANNs in virtual fish bodies, and raised the artificial fish in virtual fish tanks that mimicked the rearing conditions of real fish. We then compared the social preferences that emerged in real fish versus artificial fish. We found that when artificial fish had two core learning mechanisms (reinforcement learning and curiosity-driven learning), artificial fish developed fish-like social preferences. Like real fish, the artificial fish spontaneously learned to prefer members of their own group over members of other groups. The artificial fish also spontaneously l
Multiple object tracking (MOT) technology has made significant progress in terrestrial applications, but underwater tracking scenarios remain underexplored despite their importance to marine ecology and aquaculture. In this paper, we present Multiple Fish Tracking Dataset 2025 (MFT25), a comprehensive dataset specifically designed for underwater multiple fish tracking, featuring 15 diverse video sequences with 408,578 meticulously annotated bounding boxes across 48,066 frames. Our dataset captures various underwater environments, fish species, and challenging conditions including occlusions, similar appearances, and erratic motion patterns. Additionally, we introduce Scale-aware and Unscented Tracker (SU-T), a specialized tracking framework featuring an Unscented Kalman Filter (UKF) optimized for non-linear swimming patterns of fish and a novel Fish-Intersection-over-Union (FishIoU) matching that accounts for the unique morphological characteristics of aquatic species. Extensive experiments demonstrate that our SU-T baseline achieves state-of-the-art performance on MFT25, with 34.1 HOTA and 44.6 IDF1, while revealing fundamental differences between fish tracking and terrestrial obj
Fish detection in water-land transfer has significantly contributed to the fishery. However, manual fish detection in crowd-collaboration performs inefficiently and expensively, involving insufficient accuracy. To further enhance the water-land transfer efficiency, improve detection accuracy, and reduce labor costs, this work designs a new type of lightweight and plug-and-play edge intelligent vision system to automatically conduct fast fish detection with high-speed camera. Moreover, a novel similarity-aware vision Transformer for fast fish detection (FishViT) is proposed to onboard identify every single fish in a dense and similar group. Specifically, a novel similarity-aware multi-level encoder is developed to enhance multi-scale features in parallel, thereby yielding discriminative representations for varying-size fish. Additionally, a new soft-threshold attention mechanism is introduced, which not only effectively eliminates background noise from images but also accurately recognizes both the edge details and overall features of different similar fish. 85 challenging video sequences with high framerate and high-resolution are collected to establish a benchmark from real fish w
Fish tracking plays a vital role in understanding fish behavior and ecology. However, existing tracking methods face challenges in accuracy and robustness dues to morphological change of fish, occlusion and complex environment. This paper proposes FishMOT(Multiple Object Tracking for Fish), a novel fish tracking approach combining object detection and IoU matching, including basic module, interaction module and refind module. Wherein, a basic module performs target association based on IoU of detection boxes between successive frames to deal with morphological change of fish; an interaction module combines IoU of detection boxes and IoU of fish entity to handle occlusions; a refind module use spatio-temporal information uses spatio-temporal information to overcome the tracking failure resulting from the missed detection by the detector under complex environment. FishMOT reduces the computational complexity and memory consumption since it does not require complex feature extraction or identity assignment per fish, and does not need Kalman filter to predict the detection boxes of successive frame. Experimental results demonstrate FishMOT outperforms state-of-the-art multi-object trac
Monocular absolute 3D fish pose estimation allows for efficient fish length measurement in the longline fisheries, where fishes are under severe deformation during the catching process. This task is challenging since it requires locating absolute 3D fish keypoints based on a short monocular video clip. Unlike related works, which either require expensive 3D ground-truth data and/or multiple-view images to provide depth information, or are limited to rigid objects, we propose a novel frame-based method to estimate the absolute 3D fish pose and fish length from a single-view 2D segmentation mask. We first introduce a relative 3D fish template. By minimizing an objective function, our method systematically estimates the relative 3D pose of the target fish and fish 2D keypoints in the image. Finally, with a closed-form solution, the relative 3D fish pose can help locate absolute 3D keypoints, resulting in the frame-based absolute fish length measurement, which is further refined based on the statistical temporal inference for the optimal fish length measurement from the video clip. Our experiments show that this method can accurately estimate the absolute 3D fish pose and further measu
Given a sufficiently large training dataset, it is relatively easy to train a modern convolution neural network (CNN) as a required image classifier. However, for the task of fish classification and/or fish detection, if a CNN was trained to detect or classify particular fish species in particular background habitats, the same CNN exhibits much lower accuracy when applied to new/unseen fish species and/or fish habitats. Therefore, in practice, the CNN needs to be continuously fine-tuned to improve its classification accuracy to handle new project-specific fish species or habitats. In this work we present a labelling-efficient method of training a CNN-based fish-detector (the Xception CNN was used as the base) on relatively small numbers (4,000) of project-domain underwater fish/no-fish images from 20 different habitats. Additionally, 17,000 of known negative (that is, missing fish) general-domain (VOC2012) above-water images were used. Two publicly available fish-domain datasets supplied additional 27,000 of above-water and underwater positive/fish images. By using this multi-domain collection of images, the trained Xception-based binary (fish/not-fish) classifier achieved 0.17% fa
Fish migration is a dynamic phenomenon observed in many surface water bodies on the earth, while its understanding is still insufficient. Particularly, the biological mechanism behind fish migration is not fully understood. Moreover, its observation is often conducted visually and hence manually, raising questions of accuracy and interpretation of the data sampled. We address the two issues, mechanism and observation, of fish migration based on a recently developed mathematical model. The results obtained in this short paper show that fish migration can be characterized through a minimization principle and evaluate the error of its manual observations. The minimization principle we hypothesize is an optimal control problem where the migrating fish population dynamically changes its size and fluctuation. We numerically investigate alternating and intensive observation schemes as case studies, demonstrating that in some realistic conditions the estimate of total fish count is not reliable. We believe that this paper contributes to a deeper understanding of fish migration.
Robotic fish is one of the most promising directions of the new generation of underwater vehicles. Traditional biomimetic fish often mimic fish joints using tandem components like servos, which leads to increased volume, weight and control complexity. In this paper, a new double-joint robotic fish using a composite linkage was designed, where the propulsion mechanism transforms the single-degree-of-freedom rotation of the motor into a double-degree-of-freedom coupled motion, namely caudal peduncle translation and caudal fin rotation. Motion analysis of the propulsion mechanism demonstrates its ability to closely emulate the undulating movement observed in carangiform fish. Experimental results further validate the feasibility of the proposed propulsion mechanism. To improve propulsion efficiency, an analysis is conducted to explore the influence of swing angle amplitude and swing frequency on the swimming speed of the robotic fish. This examination establishes a practical foundation for future research on such robotic fish systems.
In open water, social fish gather to form schools, in which fish generally align with each other. In this work, we study how this social behavior evolves when perturbed by artificial obstacles. We measure the collective behavior of a group of zebrafish in the presence of a periodic array of pillars. When pillar density is low, the fish regroup with a typical inter-distance and a well-polarized state with parallel orientations, similar to their behavior in open water conditions. Above a critical density of pillars, their social interactions, which are mostly based on vision, are screened and the fish spread randomly through the aquarium, orienting themselves along the free axes of the pillar lattice. The abrupt transition from natural to artificial orientation happens when the pillar inter-distance is comparable to the social distance of the fish, i.e., their most probable inter-distance. We develop a stochastic model of the relative orientation between fish pairs, taking into account alignment, anti-alignment and tumbling, from a distribution biased by the environment. This model provides a good description of the experimental probability distribution of the relative orientation be
In recent years, multiplexed Fluorescence In Situ Hybridization (FISH) or FISH-omics methods have rapidly expanded, enabling the quantification of chromatin organization in single cells, often in conjunction with measurements of RNA and protein. These approaches have deepened our understanding of how 3D chromosome architecture relates to transcriptional activity and cell states in health and disease. Despite these advances, results from Chromatin Tracing FISH-omics experiments remain challenging to share, reuse, and analyze due to the absence of standardized data exchange specifications. Building on the release of microscopy metadata standards, we introduce the FISH Omics Format-Chromatin Tracing (FOF-CT), a community-developed standard for processed results from diverse imaging modalities. We describe the FOF-CT file format and present a curated collection of datasets deposited in the 4DN Data Portal and the OME Image Data Resource (IDR). We also highlight their potential for reuse, integration, and modeling by outlining example analysis pipelines and illustrating biological insights enabled by standardized, FAIR-compliant Chromatin Tracing datasets. While this manuscript focuses
The size structure of fish-communities is an emergent high-level property of marine food webs responsive to changes in structure and function. To measure this food web property using data arising from routine fisheries surveys, a simple metric known as Typical Length has been proposed as more suitable than the Large Fish Indicator, which has been highly engineered to be responsive to fishing pressure. Typical Length avoids the inherent dependence of the Large Fish Indicator on a parameter that requires case-by-case adjustments. Using IBTS survey time series for five spatial subdivisions of the Greater North Sea, we show that the Typical Length can provide information equivalent to the Large Fish Indicator when fishing is likely the strongest driver, but differences can also arise. In this example, Typical Length exhibits smaller random fluctuations ("noise") than the Large Fish Indicator. Typical Length is also more adaptable than the Large Fish Indicator and can be easily applied to monitor pelagic fish in addition to demersal fish, and together with information on the potential growth of the fish community, a proxy of which can be derived from the Mean Maximum Length indicator, i
Accurate phenotypic analysis in aquaculture breeding necessitates the quantification of subtle morphological phenotypes. Existing datasets suffer from limitations such as small scale, limited species coverage, and inadequate annotation of keypoints for measuring refined and complex morphological phenotypes of fish body parts. To address this gap, we introduce FishPhenoKey, a comprehensive dataset comprising 23,331 high-resolution images spanning six fish species. Notably, FishPhenoKey includes 22 phenotype-oriented annotations, enabling the capture of intricate morphological phenotypes. Motivated by the nuanced evaluation of these subtle morphologies, we also propose a new evaluation metric, Percentage of Measured Phenotype (PMP). It is designed to assess the accuracy of individual keypoint positions and is highly sensitive to the phenotypes measured using the corresponding keypoints. To enhance keypoint detection accuracy, we further propose a novel loss, Anatomically-Calibrated Regularization (ACR), that can be integrated into keypoint detection models, leveraging biological insights to refine keypoint localization. Our contributions set a new benchmark in fish phenotype analysis
We address the question of how to connect predictions by hydrodynamic models of how sea lice move in water to observable measures that count the number of lice on each fish in a cage in the water. This question is important for management and regulation of aquacultural practice that tries to maximise food production and minimise risk to the environment. We do this through a simple rule-based model of interaction between sea lice and caged fish. The model is simple: sea lice can attach and detach from a fish. The model has a novel feature, encoding what is known as a master equation producing a time-series of distributions of lice on fish that one might expect to find if a cage full of fish were placed at any given location. To demonstrate how this works, and to arrive at a rough estimate of the interaction rates, we fit a simplified version of the model with three free parameters to publicly available data about an experiment with sentinel cages in Loch Linnhe in Scotland. Our construction, coupled to the hydrodynamic models driven by surveillance data from industrial farms, quantifies the environmental impact as: what would the infection burden look like in a notional cage at any
The focus of this paper is a key component of a methodology for understanding, interpolating, and predicting fish movement patterns based on spatiotemporal data recorded by spatially static acoustic receivers. Unlike GPS trackers which emit satellite signals from the animal's location, acoustic receivers are akin to stationary motion sensors that record movements within their detection range. Thus, for periods of time, fish may be far from the receivers, resulting in the absence of observations. The lack of information on the fish's location for extended time periods poses challenges to the understanding of fish movement patterns, and hence, the identification of proper statistical inference frameworks for modeling the trajectories. As the initial step in our methodology, in this paper, we devise and implement a simulation-based imputation strategy that relies on both Markov chain and random-walk principles to enhance our dataset over time. This methodology will be generalizable and applicable to all fish species with similar migration patterns or data with similar structures due to the use of static acoustic receivers.
A fish farm is an area where fish raise and bred for food. Fish farm environments support the care and management of seafood within a controlled environment. Over the past few decades, there has been a remarkable increase in the calorie intake of protein attributed to seafood. Along with this, there are significant opportunities within the fish farming industry for economic development. Determining the fish diseases, monitoring the aquatic organisms, and examining the imbalance in the water element are some key factors that require precise observation to determine the accuracy of the acquired data. Similarly, due to the rapid expansion of aquaculture, new technologies are constantly being implemented in this sector to enhance efficiency. However, the existing approaches have often failed to provide an efficient method of farming fish. This work has kept aside the traditional approaches and opened up new dimensions to perform accurate analysis by adopting a distributed ledger technology. Our work analyses the current state-of-the-art of fish farming and proposes a fish farm ecosystem that relies on a private-by-design architecture based on the Hyperledger Fabric private-permissioned
While mathematical models, in particular self-propelled particle (SPP) models, capture many of the observed properties of large fish schools, they do not always capture the interactions of smaller shoals. Nor do these models tend to account for the observation that, when swimming alone or in smaller groups, many species of fish use intermittent locomotion, often referred to as burst and coast or burst and glide. Recent empirical studies have suggested that burst and glide movement is indeed pivotal to the social interactions of individual fish. In this paper, we propose a model of social burst and glide motion by combining a well-studied model of neuronal dynamics, the FitzHugh-Nagumo model, with a model of fish motion. We begin by showing that the model can capture the motion of a single fish swimming down a channel. By then extending to a two fish model, where visual stimuli of the position of the other fish affect the internal burst or glide state of the fish, we find that our model captures a rich set of swimming dynamics found in many species of fish. These include: leader-follower behaviour; periodic changes in leadership; apparently random (i.e. chaotic) leadership change; a