PingER: a lightweight active end-to-end network monitoring tool/project
Prepared by Les Cottrell, SLAC,  for the
RIPE 46 Meeting, Amsterdam Sept 2003
www.slac.stanford.edu/grp/scs/net/talk03/ripe-sep03.ppt

Methodology
Use ubiquitous ping
1 ping to prime caches,
Each 30 minutes: from Monitoring site to target: by default send10x100Byte pkts then 10x1000Byte pkts
Record loss & RTT, (+ reorders, duplicates)
Derive throughput, jitter, unreachability …

Architecture

Countries Monitored

Recent additions
Added hosts in Macedonia, Serbia/Montenegro, Belarus, Turkey, Armenia, Mexico and Azerbaijan, Tajikistan, Turkeministan, Kyrgyzstan
Contacts
Working with contacts in Vietnam, the Philippines, Albania, and Tunisia
Looking for contacts in Cuba, Kenya, Algeria and South Africa, Uganda
Working with Iran site to become monitor host
Increased hosts monitored from CERN to give better European view
Now monitoring 60 countries

Visualization
Keep it simple, enable user to do their own by making data available
Tables
Time series (www-iepm.slac.stanford.edu/cgi-wrap/pingtable.pl):
select metric (loss, RTT etc.), time ticks, packet size, aggregations from/to, etc.
Color code numbers, provide sort, drill down to graphs, download data (TSV), statistical summaries
Monitoring site vs. Remote sites (www-iepm.slac.stanford.edu/cgi-wrap/table.pl):
Select metric, region aggregations
Drill down to time series, download data
Graphs
Select source(s)/destination(s), metric, time window, SQL selects, graph type

Publish information
www.slac.stanford.edu/cgi-wrap/pingtable.pl => tabular reports
Data accessible from MonaLisa
Implementing web services access prototype
Includes: PingER, IEPM-BE, RIPE-tt, I2 E2Epi OWAMP
Use GGF/NMWG schema/profile, e.g.
path.delay.roundTrip

PingER Benefits
Provides quantitative historical (> 8yrs) and near real-time information
Aggregate by regions, affiliations etc.
How bad is performance to various regions, rank countries?
Trends: who is catching up, falling behind, is progress being made?
Compare vs. economic, development indicators etc.
Use for trouble shooting setting expectations, identify needed upgrades, choosing a provider, presenting to policy makers, funding bodies

Usage Examples
Selecting ISPs for DSL/Cable services for home users
Monitor accessibility of routers etc. from site
Long term and changes
Trouble shooting
Identifying problem reported is probably network related
Identify when it started and if still happening or fixed
Look for patterns:
Step functions
Periodic behavior, e.g. due to congestion
Multiple sites with simultaneous problems, e.g. common problem link/router …
Provide quantitative information to ISPs

Usage Examples

Current State – June ‘03 (throughput)
Within region performance better
E.g. Ca|US-NA, Hu-SE Eu, Eu-Eu, Jp-E Asia, Au-Au, Ru-Ru
Africa, Caucasus, Central & S. Asia all bad

Trends

Slide 13

Network Readiness
NRI from Center for International Development, Harvard U. http://www.cid.harvard.edu/cr/pdf/gitrr2002_ch02.pdf

Challenges
Effort:
Negligible for remote hosts
Monitoring host: < 1 day to install and configure, occasional updates to remote host tables and problem response
Archive host: 20% FTE, code stable, could do with upgrade, contact monitoring sites whose data is inaccessible
Analysis: your decision, usually for long term details download & use Excel
Trouble-shooting:
usually re-active, user reports, then look at PingER data
have played with automating alerts, data will/is available via web services
Ping blocking
Complete block easy to ID, then contact site to try and by-pass, can be frustrating for 3rd world
Partial blocks trickier, compare with synack
Derived throughputs poor for well connected sites (<0.1% loss)
Funding
“Unfortunately, network management research has historically been very under-funded, because it is difficult to get funding bodies to recognize this as legitimate networking research.”  Sally Floyd, IAB Concerns & Recommendations Regarding Internet Research & Evolution.
http://www.ietf.org/internet-drafts/draft-iab-research-funding-00.txt

Collaborations & Funding
35+ monitoring sites in 15 countries
Plan to add ICTP Trieste if funded
Other projects used toolkit, e.g. XIWT, PPCNG/EDG …
SLAC with help from FNAL
Digital Divide collaboration (MOU) with ICTP, Trieste
eJDS
They are looking for a EU grant for eJDS and PingER
Need funding for coming year:
Working with DoE, NSF, Pew Charitable Foundation …
Tasks:
(0.5 FTE) ongoing maintain data collection, explain needs, reopen connections, open firewall blocks, find replacement hosts, make limited special analyses, prepare & make presentations, respond to questions
(+ 0.5 FTE) extend the code for new environment (more countries, more data collections), fix known non-critical bugs, improve visualization, automate  reports generated by hand today, find new country site contacts, add route histories and visualization, automate alarms, update web site for better navigation, add more DD monitoring sites/countries, improve code portability
Also looking for small grants for helpers in developing countries
ICFA: show importance to policy makers, funding agencies, identify sympathetic contacts at agencies, get support
Ported to IPv6

Summary
Valuable light-weight tool for end-to-end performance
Good for trouble-shooting, planning, setting expectations
World wide coverage
Performance from U.S. is improving all over
Performance to developed countries are orders of magnitude better than to developing countries
Poorer regions 5-10 years behind
Poorest regions Africa, Caucasus, Central & S. Asia
Some regions are:
catching up (SE Europe, Russia),
keeping up (Latin America, Mid East, China),
falling further behind (e.g. India, Africa)

More Information
PingER:
www-iepm.slac.stanford.edu/pinger/
MonaLisa
monalisa.cacr.caltech.edu/
GGF/NMWG
www-didc.lbl.gov/NMWG/
ICFA/SCIC Network Monitoring report, Jan03
www.slac.stanford.edu/xorg/icfa/icfa-net-paper-dec02
Monitoring the Digital Divide, CHEP03 paper
arxiv.org/ftp/physics/papers/0305/0305016.pdf
Human Development Index
www.undp.org/hdr2003/pdf/hdr03_backmatter_2.pdf
Network Readiness Index
www.weforum.org/site/homepublic.nsf/Content/Initiatives+subhome

Extra Slides

Countries Monitored

Loss Comparisons with Development (UNDP)