TY - GEN
T1 - A Video-Based End-to-end Pipeline for Non-nutritive Sucking Action Recognition and Segmentation in Young Infants
AU - Zhu, Shaotong
AU - Wan, Michael
AU - Hatamimajoumerd, Elaheh
AU - Jain, Kashish
AU - Zlota, Samuel
AU - Kamath, Cholpady Vikram
AU - Rowan, Cassandra B.
AU - Grace, Emma C.
AU - Goodwin, Matthew S.
AU - Hayes, Marie J.
AU - Schwartz-Mette, Rebecca A.
AU - Zimmerman, Emily
AU - Ostadabbas, Sarah
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.
PY - 2023
Y1 - 2023
N2 - We present an end-to-end computer vision pipeline to detect non-nutritive sucking (NNS)—an infant sucking pattern with no nutrition delivered—as a potential biomarker for developmental delays, using off-the-shelf baby monitor video footage. One barrier to clinical (or algorithmic) assessment of NNS stems from its sparsity, requiring experts to wade through hours of footage to find minutes of the relevant activity. Our NNS activity segmentation algorithm tackles this problem by identifying periods of NNS with high certainty—up to 94.0% average precision and 84.9% average recall across 30 heterogeneous 60 s clips, drawn from our manually annotated NNS clinical in-crib dataset of 183 h of overnight baby monitor footage from 19 infants. Our method is based on an underlying NNS action recognition algorithm, which uses spatiotemporal deep learning networks and infant-specific pose estimation, achieving 94.9% accuracy in binary classification of 960 2.5 s balanced NNS vs. non-NNS clips. Tested on our second, independent, and public NNS in-the-wild dataset, NNS recognition classification reaches 92.3% accuracy, and NNS segmentation achieves 90.8% precision and 84.2% recall. Our code and the manually annotated NNS in-the-wild dataset can be found at https://github.com/ostadabbas/NNS-Detection-and-Segmentation. Supported by MathWorks and NSF-CAREER Grant #2143882.
AB - We present an end-to-end computer vision pipeline to detect non-nutritive sucking (NNS)—an infant sucking pattern with no nutrition delivered—as a potential biomarker for developmental delays, using off-the-shelf baby monitor video footage. One barrier to clinical (or algorithmic) assessment of NNS stems from its sparsity, requiring experts to wade through hours of footage to find minutes of the relevant activity. Our NNS activity segmentation algorithm tackles this problem by identifying periods of NNS with high certainty—up to 94.0% average precision and 84.9% average recall across 30 heterogeneous 60 s clips, drawn from our manually annotated NNS clinical in-crib dataset of 183 h of overnight baby monitor footage from 19 infants. Our method is based on an underlying NNS action recognition algorithm, which uses spatiotemporal deep learning networks and infant-specific pose estimation, achieving 94.9% accuracy in binary classification of 960 2.5 s balanced NNS vs. non-NNS clips. Tested on our second, independent, and public NNS in-the-wild dataset, NNS recognition classification reaches 92.3% accuracy, and NNS segmentation achieves 90.8% precision and 84.2% recall. Our code and the manually annotated NNS in-the-wild dataset can be found at https://github.com/ostadabbas/NNS-Detection-and-Segmentation. Supported by MathWorks and NSF-CAREER Grant #2143882.
KW - Action recognition
KW - Action segmentation
KW - Non-nutritive sucking
KW - Optical flow
KW - Temporal convolution
UR - https://www.scopus.com/pages/publications/85174746261
U2 - 10.1007/978-3-031-43895-0_55
DO - 10.1007/978-3-031-43895-0_55
M3 - Conference contribution
AN - SCOPUS:85174746261
SN - 9783031438943
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 586
EP - 595
BT - Medical Image Computing and Computer Assisted Intervention – MICCAI 2023 - 26th International Conference, Proceedings
A2 - Greenspan, Hayit
A2 - Greenspan, Hayit
A2 - Madabhushi, Anant
A2 - Mousavi, Parvin
A2 - Salcudean, Septimiu
A2 - Duncan, James
A2 - Syeda-Mahmood, Tanveer
A2 - Taylor, Russell
PB - Springer Science and Business Media Deutschland GmbH
T2 - 26th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2023
Y2 - 8 October 2023 through 12 October 2023
ER -