Home > Published Issues > 2026 > Volume 21, No. 2, 2026 >
JCM 2026 Vol.21(2): 309-321
Doi: 10.12720/jcm.21.2.309-321

Long-Term Production-Scale Empirical Characterization of SRT Transport Metrics for Reinforcement Learning-Based Adaptive Streaming Control

Ahmed F. K. Koysha*, Ali Güneş, Selçuk Şener, and Ferdi Sönmez*
Department of Computer Engineering, Istanbul Aydin University, Istanbul, Türkiye
Email: afouadkoysha@stu.aydin.edu.tr (A.F.K.K.); aligunes@aydin.edu.tr (A.G.); selcuksener@aydin.edu.tr (S.Ş.); ferdisonmez@aydin.edu.tr (F.S.)
*Corresponding author

Manuscript received November 7, 2025; revised December 7, 2025; accepted January 4, 2026; published April 24, 2026.

Abstract—This study presents the first long-term, production-scale empirical characterization of Secure Reliable Transport (SRT) protocol metrics for Reinforcement Learning (RL)-based adaptive streaming control in low-latency live video applications. Production data spanning 722 hours (approximately 30 days) were collected from an IEEE 802.11ac wireless network, yielding 64,340 aligned multivariate samples. These data were analyzed using statistical methods and machine learningbased anomaly detection. Key findings include: (1) strong positive correlation between average Round-Trip Time (RTT) and jitter (r = 0.912, 95% CI: 0.909–0.915), indicating coupled latency dynamics suitable for state-space reduction; (2) Isolation Forest detected 4.8× more anomalies than the Interquartile Range (IQR) method (6,425 vs. 1,339), demonstrating superior multivariate pattern recognition; (3) three distinct anomaly regimes were identified: congestiondriven, timing-driven, and bandwidth-limited; (4) a datadriven three-level operational monitoring framework (Green/Yellow/Red) was derived for five SRT metrics using convergent statistical evidence; and (5) comprehensive RL validation demonstrated effective reward signal translation: rule-based (+82%) and Q-learning (+54%) controllers outperformed static baselines at hourly resolution; at 5- minute resolution, Q-learning achieved +138.1% improvement with 98% oscillation reduction; in closed-loop emulation, Q-learning achieved +81.3% improvement with minimal red-zone violations (0.23%). These findings establish empirical foundations for future RL-based SRT controllers, bridging network measurement research and adaptive streaming development. Limitations regarding single-network generalizability and recalibration for heterogeneous environments are discussed.


Keywords—SRT protocol, reinforcement learning, anomaly detection, low-latency video streaming, multivariate analysis, Isolation Forest, adaptive bitrate control, operational thresholds, closed-loop validation, temporal resolution analysis

Cite: Ahmed F. K. Koysha, Ali Güneş, Selçuk Şener, and Ferdi Sön, “Long-Term Production-Scale Empirical Characterization of SRT Transport Metrics for Reinforcement Learning-Based Adaptive Streaming Control," Journal of Communications, vol. 21, no. 2, pp. 309-321, 2026.

Copyright © 2026 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Article Metrics in Dimensions