Internet2
Site Index | Internet2 Searchlight |
Membership | Communities | Services | Projects | Tools | Events | Newsroom | About
 | Home
End-to-End Performance Initiative
> About Us
> Staff
> Contact
Resources
> Tools
> Presentations
> Library
> Case Studies


Network Performance
> perfSONAR-PS
> BWCTL
> OWAMP
> NDT
> Thrulay
> Workshops
> NPToolkit
> MP Directory
> RPM
> Phoebus


Community Engagement
> Working Groups
> Collaborations
Making e-VLBI Happen
 
A Case Study for Bandwidth Test Controller (BWCTL)
 

Do you have to create a tuned path for a real-time point-to-point data transfer? Like many, you may want to use Iperf to establish the expected performance on your existing path. Iperf can help determine the achievable (or available) bandwidth between a pair of hosts.

It is often useful to test to multiple points along a network path to determine the network characteristics along that path. Typically, users who want to conduct path decomposition must directly contact the network or system administrators who control the hosts along the path. Iperf tests can consume a great deal of bandwidth and, as a result, many network administrators are wary about to whom, when, and for how long they will allow such tests on their networks. Each administrator must either run half of the test for the user or provide a user account on the host, which is untenable for all but well-known testers.

Most network paths involve multiple administrators. This is made even more complicated when the test locations are in different countries (even different continents, as in the case of radio astronomy) and backbone networks. These hurdles make path decomposition difficult in practice.

Internet2’s piPEs, for which Bandwidth Test Controller (BWCTL) was created, runs a full mesh of regularly scheduled active tests on the 11-node Abilene backbone (to collect data on loss, jitter, throughput, and one- way latency).

Because piPEs also allows on demand tests, the designers realized that piPEs must have a method to control when both regularly scheduled tests were performed and allow on-demand tests users need (without allowing those tests to interfere with the regular testing). These are requirements beyond the scope for which Iperf was designed; BWCTL, born of that need, is a command line client application and a scheduling and policy daemon that executes the Iperf command line program (although this may be changed in the future).

Case In Point

On March 25, 2004, the first successful real-time international transmission and processing of VLBI data was conducted between MIT Haystack Observatory's Westford antenna in Massachusetts and the Onsala Space Observatory antenna in Sweden. Data from the Onsala observatory were transmitted over the GEANT network to New York and then over Internet2's Abilene Network to the Haystack Observatory. Although the data streamed at relatively low bandwidth (32 Mbps), it was still a milestone demonstration of real-time simultaneous correlation of data from two antennas. Using the BWCTL component of Internet2’s E2E piPEs and intermediate BWCTL beacons in Washington and London, researchers at Haystack could run tests between them to assist in diagnosing transcontinental network problems during the experiment.

At the time, Dr. David Lapsley was a Research Engineer at MIT Haystack Observatory in Massachusetts. Lapsley was working to enable radio astronomers practicing VLBI to shift vast amounts of data in short periods of time across research networks spanning the globe, using high-speed networks. This “e-VLBI” will allow astronomers at various locations to tune their radio telescopes in real-time, to catch interesting events as they are identified by other telescope locations.

Lapsley worked with VLBI researchers in the United States and around the world (Sweden, The Netherlands, Norway, Japan, Chile and elsewhere). The data collected is transported on a wide variety of networks, including Abilene, GEANT, APAN, SURFnet, and NORDUnet, among others.

In trying to optimize the paths between Haystack and Onsala telescope locations, Lapsley discovered that he was not getting the performance he needed to make e-VLBI more efficient than the existing system of downloading data onto tapes and shipping them to the nearest correlator. Lapsley heard about BWCTL at an Internet2 Member Meeting and joined the early adopters group.

For several of the telescopes involved in the testing, Lapsley did not have root privileges or administrative access needed to run standard Iperf tests; also, many of the machines to which he wanted to test were heavily scheduled. By having the remote locations download BWCTL and install it on a server at their location, Lapsley could schedule tests whenever machines were available.

Using BWCTL, he was able to pinpoint a packet loss problem to local congestion in the Haystack Observatory area; as a result, this bottleneck link was updated in May 2004. BWCTL, which he called“an excellent diagnostic tool,” allowed Lapsley to perform partial path analysis, quickly eliminating portions of the path from consideration.

When setting up similar tests with Kashima Space Research Center (in Japan) in May 2004, he discovered that the end-to-end bandwidth was ~1 Mbps (which was not surprising, given that they were using 100 Mbps "commodity" access at each end). They decided to cache data in Washington, where they have a server, connected at 1 Gbps to ISI-E, MAX, and Abilene, that is able to get high bandwidth to Tokyo across Transpac. They were very surprised to find that the bandwidth was less than 4 Mbps using eight streams.

When faced with the usual situation of trying to isolate the problem using pairwise Iperf testing between nodes and then trying to construct an overall picture of the network status, a tedious, time consuming and often error prone situation, they decided instead to "instrument" their test network with BWCTL:

http://web.haystack.mit.edu/staff/dlapsley/tsev7.html.

Although the website is still basic, it is the first time they have had a good overall view of what is going on in the network and how it changes with time. Lapsley reported that it took only one afternoon to setup the BWCTL partial mesh using some simple scripts that he developed for the Onsala-Haystack tests.

Lapsley added that, via BWCTL, they were able to isolate the cause of the performance problem to a particular network segment. Some additional detective work by his colleagues isolated the problem to a faulty NIC in a server on this segment. By isolating the problem, the NIC was replaced just in time for the e-VLBI experiment. In collaboration with their Japanese colleagues, Lapsley’s team was able to reduce the time for a particular type of e-VLBI experiment by a factor of > 4.

In the future, VLBI researchers will use this tool to optimize the time of day for non-real-time bulk data transfers and identify the most aggressive application usable at any given time of day for real-time transfers.

Acknowledgements

MIT Haystack Observatory, in collaboration with the MIT Laboratory for Computer Science and Artificial Intelligence Laboratory (CSAIL), has recently been awarded a grant by NSF to develop new IP protocols specifically tailored to applications such as e-VLBI.

MIT Haystack gratefully acknowledges the support, assistance, and suggestions of Masaki Hirabaru and Yasuhiro Koyama (National Institute of Information and Communications Technology), Rüdiger Haas (Chalmers Onsala Space Observatory), Tom Lehman (University of Southern California, Information Sciences Institute) and staff of the APAN Tokyo XP.


PDF Version of Document

More Information

 

REVIEW THIS ARTICLE
 
Please share your comments; if you have any questions be sure to include your email address.
 
Read Other Reviews

© 1996 - 2008 Internet2 - All rights reserved | Terms of Use | Privacy | Contact Us
1000 Oakbrook Drive, Suite 300, Ann Arbor MI 48104 | Phone: +1-734-913-4250