Skip to main navigation Skip to search Skip to main content

A data throughput prediction and optimization service for widely distributed many-task computing

  • Louisiana State University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

In this paper, we present the design and implementation of a network throughput prediction and optimization service for many-task computing in widely distributed environments. This service uses multiple parallel TCP streams to improve the end-to-end throughput of data transfers. A novel mathematical model is used to decide the number of parallel streams to achieve best performance. This model can predict the optimal number of parallel streams with as few as three prediction points. We implement this new service in the Stork data scheduler, where the prediction points can be obtained using Iperf and GridFTP samplings. Our results show that the prediction cost plus the optimized transfer time is much less than the unoptimized transfer time in most cases.

Original languageEnglish
Title of host publicationProceedings of the 2nd ACM Workshop on Many-Task Computing on Grids and Supercomputers 2009, MTAGS '09
DOIs
StatePublished - 2009
Event2nd ACM Workshop on Many-Task Computing on Grids and Supercomputers 2009, MTAGS '09 - Portland, OR, United States
Duration: Nov 16 2009Nov 16 2009

Publication series

NameProceedings of the 2nd ACM Workshop on Many-Task Computing on Grids and Supercomputers 2009, MTAGS '09

Conference

Conference2nd ACM Workshop on Many-Task Computing on Grids and Supercomputers 2009, MTAGS '09
Country/TerritoryUnited States
CityPortland, OR
Period11/16/0911/16/09

Keywords

  • Many-task computing
  • Modeling
  • Optimization
  • Parallel TCP streams
  • Prediction
  • Scheduling
  • Stork

Fingerprint

Dive into the research topics of 'A data throughput prediction and optimization service for widely distributed many-task computing'. Together they form a unique fingerprint.

Cite this