macroscope: end-point approach to networked application...

Macroscope: End-Point Approach to NetworkedApplication Dependency Discovery

Lucian PopaUniversity of California

[email protected]

Byung-Gon ChunIntel Labs Berkeley

[email protected]

Ion StoicaUniversity of California

[email protected]

Jaideep ChandrashekarIntel Labs Berkeley

[email protected]

Nina TaftIntel Labs [email protected]

ABSTRACTEnterprise and data center networks consist of a large number ofcomplex networked applications and services that depend upon eachother. For this reason, they are difficult to manage and diagnose. Inthis paper we propose Macroscope, a new approach to extractingthe dependencies of networked applications automatically by com-bining application process information with network level packettraces. We evaluate Macroscope on traces collected at 52 laptopswithin a large enterprise and show that Macroscope is accurate infinding the dependencies of networked applications. We also showthat Macroscope requires less human involvement and is signifi-cantly more accurate than state of the art approaches that use onlypacket traces. Using our rich profiles of the application-servicedependencies, we explore and uncover some interesting character-istics about this relationship. Finally, we discuss several usage sce-narios that can benefit from Macroscope.

Categories and Subject DescriptorsC.2.3 [Computer-Communication Networks]: Network Opera-tions

General TermsDesign, Measurement

KeywordsApplication dependencies, end-point tracing

1. INTRODUCTIONToday’s enterprise networks and datacenters are extremely com-

plicated entities, running a large number of interdependent appli-cations and services. A recent survey stated that companies suchas Citigroup and HP run applications in their networks, number-ing in the thousands [2]. While this sheer number is remarkable,

Permission to make digital or hard copies of all or part of this work forpersonal or classroom use is granted without fee provided that copies arenot made or distributed for profit or commercial advantage and that copiesbear this notice and the full citation on the first page. To copy otherwise, torepublish, to post on servers or to redistribute to lists, requires prior specificpermission and/or a fee.CoNEXT’09, December 1–4, 2009, Rome, Italy.Copyright 2009 ACM 978-1-60558-636-6/09/12 ...$10.00.

the actual complicating issue is the manner in which they are cou-pled to each other. For example Microsoft Outlook, ostensibly anemail client, depends on services such as AD (active directory),LDAP, DNS, network proxies, among others. When users in thenetwork cannot “see their email”, what should the network admin-istrator look at? The key challenge facing these complicated de-ployments is management related. It is estimated that up to 70%of an enterprise’s IT cost goes toward maintenance and configura-tion costs [15], i.e., tasks such as fault localization, performancedebugging, reconfiguration planning and dependency analysis.

Discovering (or mapping) dependencies between applications andservices has been recognized previously as an important task [5, 6,9,12]. In much of this work, the dependency discovery task has re-lied solely on network-level packet trace data, i.e., traces collectedin the network. For example, Sherlock [5] uses the time correlationof packets between different services (in particular, the number ofco-occurrences in some time interval), and Orion [9] uses the spikesof delay distribution of flow pairs. Using this data is advantageousin some ways; it can be collected quite easily, without any impacton the end-hosts. However, there are several limitations, describedbelow, of using only this data to extract the application and servicedependencies.

First, these techniques can be quite imprecise. Timing correla-tions between independent services can create the illusion of causal-ity, leading to false positives. Also, when background traffic per-turbs the timing of network packets, or if there is a high variationtemporally in how the packets are observed, false negatives canoccur. Second, approaches solely based on packet traces cannotuncover dependencies when the communication patterns are verydynamic or infrequent; this is because the dependency extractionmethods require a large number of samples to converge. Third,when only packet traces are used, some human intervention is re-quired to extract the dependencies correctly. To state this differ-ently, existing methods require some seed ground truth to extractthe application dependencies, in order to connect application levelinformation with the network level traces. This lays the accuracyof the extracted dependencies completely at the feet of the humanoperator.

In this paper, we propose a new approach to extracting the depen-dencies of applications automatically. In contrast to previous meth-ods, our approach involves a small amount of instrumentation atthe end-hosts. Specifically, we collect information at the end-hostsabout the particular applications that are generating network trafficand join this with network packet traces. Further, we correlate thisinformation from multiple end-hosts to arrive at the application de-pendencies. We argue that this extra information, which basically

229

serves to replace the human knowledge required to seed the de-pendencies (in previous methods), can be easily collected and ismore reliable. For example, contemporary enterprise deploymentsalready feature a host of security and monitoring features installedon end-hosts and it would be easy to add (lightweight) applicationtracing code to these systems.

We present Macroscope, a system that automatically extractsnetworked application dependency using network-level and appli-cation level information. Macroscope consists of three compo-nents: Tracers, a Collector, and an Analyzer. Tracers run on indi-vidual end-hosts and log application-level process information byperiodically sampling TCP/UDP connection tables and join this in-formation with network level packet traces. This allows individ-ual network flows to be associated with the generating application.These processed traces are then shipped to a centralized Collector.The co-located Analyzer correlates the traces across the populationof end-hosts to finally extract the dependencies with various prop-erties as will be described in later sections.

The extracted dependencies are critical for a number of manage-ment related tasks: (1) they can aid in fault diagnosis by enumer-ating all the key services used by an application, which allows thenetwork operators to focus on the relevant servers and services; (2)the dependencies allow network operators to understand the impactof any outages (planned or not) and to place resources to maxi-mize utility; (3) learning the temporal dependencies of an appli-cation can help operators determine the correct order in which torestart services (many web services today are composed of a num-ber of services, and ordering is critical) ; (4) knowing the correctdependencies for an application can aid with detecting anomalies inthe application or service (some malware is known to replace ex-isting executables with custom versions which make connectionsnot normally associated with the application). There are far morecompelling usage scenarios, where being able to discover and enu-merate the application dependencies is a crucial step, than we canenumerate here. However, we do revisit some of these in Section 5and provide specifics of how the learned dependencies can be used.

We summarize the key contributions of this paper:

• We propose Macroscope, a novel method to identify the service-level dependencies that applications rely on. We define a no-tion of static dependencies and present a practical method forextracting them. Our method uses application level data inconjunction with network-level traces, to build application-service dependency graphs at multiple levels of granularity.

• We evaluate our methodology with traces collected at 53 en-terprise end-hosts. We show that Macroscope is accurate indetecting application dependencies and that it outperformsby a wide margin a state-of-the-art application dependencyextraction system that uses only packet traces (Orion [9]).This quantifies the gains of bringing application level infor-mation into the problem of dependency inference.

• With our rich profiles of the application-service relationship,we carry out some exploratory assessment of over 150 appli-cations. We uncover some interesting findings about theserelations, such as an elephants and mice phenomenon forapplication-service dependencies, a surprisingly large num-ber of single-service applications, and others. Finally, wedescribe several management applications that could exploitthese properties for greater effectiveness.

The rest of this paper is organized as follows. Section 2 presentsMacroscope’s approach. Section 3 presents Macroscope’s system

architecture and the algorithms it implements. We evaluate Macro-scope in Section 4, discuss a set of usage scenarios in Section 5 andrelated work in Section 6, and finally conclude.

2. MACROSCOPE APPROACH

2.1 Dependency DefinitionsWe use the term application, denoted (app), as an application

executable (and assume the name is consistent across all the end-hosts). An application instance refers to the application being runon a particular end-host; we represent it by the tuple (app, PID,IP ), where PID is the process identifier and IP is the machinenetwork address. We break down the PID instances by the IP toincrease the number of application instance samples. For example,a process active for multiple days can be seen as a new instanceeach new work day (e.g., user logs in the application) and we ap-proximate this behavior by using IPs. We ignore the reuse of thesame PID as the process identifier space is quite large.

An application may use one or more services; the latter is identi-fied by the tuple (protocol, port). Common services include DNS,LDAP, HTTP, etc. A service instance is identified by the tuple(IP ′, protocol, port), where IP ′ represents the address of the ma-chine on which it is running.

Next we define a dependency (or relation), which is an associ-ation between one application and one service. A naive approachwould be to define the use of any service as a dependency (i.e., anestablished connection to the service from the application); how-ever, this definition is overly broad. Ideally, we want dependenciesto capture the “critical” services of an application, i.e., the servicesthat are used frequently and whose reachability predicts the successor failure of the application. Knowing such dependencies can helpto locate faults and to improve application performance (e.g., byreplicating more of the critical and frequently used services). Toapproximate this set of desirable associations, we introduce the no-tion of a static dependency. Application app has a static depen-dency on service S if the usage of S can be predicted before exe-cuting app: (1) independently of user actions or other applicationinstances and (2) using only the application’s source code and con-figuration files.1

A static dependency is denoted as the relation (app) → (protocol,port). We name the relations of this type which do not representstatic dependencies as transient relations. To clarify, connections toephemeral ports, or to ports that are dynamically generated reflecttransient relations. Compared to static dependencies, the servicescorresponding to transient relations are typically not actively man-aged by the enterprise and their use is limited to a few applicationinstances. For these reasons, transient relations are less well-suitedthan static dependencies for detecting faults, planning enterpriseresources and understanding application behavior.

By removing all connections pertaining to transient relations outof the set of all the connections established by the application, wecan arrive at the set of static dependencies. In this paper we developsimple heuristic methods to extract the (static) dependencies frompacket traces and application context data.

Multilevel Dependencies: Static dependencies provide a high-levelview of the interactions between applications and services. How-ever, relations at a lower granularity are often useful. For this pur-

1In the context of this paper, the services corresponding to staticdependencies can be seen as actively managed by the enterprise(as we shall see in §4, applications cannot directly communicateoutside our enterprise).

230

app1

app2

Application Application

instance

Service

instanceService

port1,protocol1

port2,protocol2

(app1,pid1,ip1)

(ip1',port1,protocol1)


port3,protocol3





(app2,pid2,ip1)

(app2,pid4,ip2)

(app1,pid3,ip2)

Figure 1: An example showing dependencies between applications and services. An oval represents an application or a service, anda rectangle represents a host that may run multiple applications or services.

pose, we define the following levels of granularity of the relationbetween applications and services:

1. app → (protocol, port); this captures the top level staticdependencies.

2. app → (IP ′, protocol, port), specifies the individual ser-vice instances used by an application.

3. (app,PID, IP ) → (protocol, port), captures the generalservices used by a specific application instance.

4. (app,PID, IP ) → (IP ′, protocol, port), this finest levelof granularity captures the service instance used by a partic-ular application instance.

Figure 1 illustrates an example for two applications, each withtwo application instances, and three services, each with two in-stances. There are no transient connections in this example. Staticdependencies are the connections between the applications and theservices (first and last columns). The connections between the ap-plication instances in the second column and the service instancesin the third column belong to the level four above.

Conversely, the relation between applications and services canalso be regarded from the perspective of services, such as the appli-cations that use one (or several) service(s), i.e.the mapping (protocol,port) → app.

We point out that some static dependencies may actually hidenumerous distinct services offered on the same port; these ser-vices can be offered by multiple hosts, which can belong to differ-ent enterprises (in this cases the destination IPs could be viewedas "ephemeral"). The most striking example of this type is the(tcp, 80) HTTP service. Such static dependencies may be lesseffective for the administrative purposes of fault isolation and re-source planning; to help administrators, Macroscope can presentonly the dependencies with intra-enterprise service instances. Moregenerally, in the future, we plan to consider aggregations of desti-nation IPs, such as in [11], in which particular destination IPs aregrouped together to capture the service they represent. In an enter-prise, an example would be “mail.intel.com”.

Dependency Profiling: In the dependency graph, each dependencycan also have associated a weight, such as the usage frequency (thefraction of the application instances that use that dependency) or thetraffic volume on that dependency. Depending on usage scenarios,we can use a different metric for the weights.

Finally, we can define a number of temporal relationships on thedependencies. In this paper, we only consider the Causal Orderrelation, defined as follows. Dependency A of application app

is “causally ordered before” dependency B if A is often accessedbefore B by instances of app, and the accesses of A and B oc-cur within a short time window W . Note that the remote-remotedependency definition used in Orion [9] is similar to the relationdefined above. Unlike this definition, Orion does not consider theflow directionality; in Macroscope, dependencies are always flowsinitiated by the application to the service.

2.2 End-point Tracing vs Network TracingMacroscope’s approach of recording application context at end-

hosts has two important advantages over the approaches that useonly in-network monitoring for this purpose: (1) it is fully auto-matic and (2) it is more accurate. The price we pay is the requiredinstrumentation of end hosts and the overhead incurred upon them.However, although CPU and memory usage statistics were not col-lected along with the traces used in this paper, qualitatively no usernoticed any slowdown in their machines.

Fully Automatic: By capturing the application names when sam-pling the connection table, Macroscope can correctly associate ap-plications with the connections being generated (and consequently,accurate determine the application-service dependencies). In con-trast, with approaches that solely rely on packet traces from thenetwork, the application information is supplied by a human whoseeds a small number of application-service dependencies (the de-pendencies are grown transitively). Interestingly, we show in Sec-tion 4 that many applications have only a single static dependency;in these cases, automation serves no purpose for the approachesusing only packet traces.

Improved Accuracy: Macroscope detects more of the actual de-pendencies. Previous works such as Orion [9] and Sherlock [5],rely on relations between dependencies for their detection. In gen-eral, this approach can only detect a subgraph of the real depen-dency graph (i.e., only those dependencies that exhibit that par-ticular relation can be detected); this detection is also quite noisy,since it relies on co-occurrences in a timed window. In contrast,the detection approach that we present in this paper, captures de-pendencies of an application with high accuracy, regardless of theirrelations to other dependencies. This detection is more accurateeven for the detecting dependencies with relations between thembecause we can identify which are connections of the same appli-cation.

Macroscope also reduces the rate of false positives (i.e., fake de-pendencies). For example, Orion detects application dependenciesby hand picking a set of known (seed) dependencies, and finds otherservices that are orderly dependent on these seeds (Orion uses a sin-

231

Tracer

Host1

Tracer

Host2

Tracer

Host m

Collector Analyzer

Central site

Figure 2: Macroscope system architecture.

gle seed dependency). It is easy to see how this type of detectioncan go wrong: assume application app has dependencies A and Bordered, and application app’ has dependencies X and B ordered;hand picking service B as a seed dependency for app may lead tothe wrong conclusion that app is also using X. Also, some applica-tions have no “specific” service dependency; for example the win-dows executable searchprotocolhost.exe, accesses only well knownservices (Active Directory, EPMAP, DNS and Web). This situationis almost guaranteed to lead to false positives (and potentially falsenegatives) since these services are used by multiple applicationswith other dependencies.

3. MACROSCOPEWe now describe the architecture, dependency extraction algo-

rithms, and prototype implementation of Macroscope.

3.1 System ArchitectureFigure 2 shows the overall architecture or Macroscope, which

consists of Tracer, Collector, and Analyzer.

Tracer: The Tracer runs on end-hosts and has two components:one that periodically samples and logs the OS connection tableand one that captures and logs packets. We have implemented theTracer for enterprise specific laptops running Microsoft WindowsXP.

The Tracer samples the TCP and UDP tables and appends thisinformation to a file. The sample contains the time of sampling,application-level information such as application name and pro-cess id and connection tuples (protocol, ports). The Tracer sam-ples every five seconds using the GetExtendedTcpTable andGetExtendedUdpTable calls. For UDP, the remote endpointis detected from the packet traces. The Tracer stores the data inXML files.

The Tracer also captures packet level traces using WinPcap [3].We collected the packets on the end-hosts because the data col-lection effort supported multiple research projects, some of whichneeded host mobility information. We point out that there are otherways to implement the packet tracing component of Macroscope.If the goal is to focus on dependencies that occur for applicationsused in the enterprise setting, then packet traces could alternativelybe captured by network monitors located inside a corporate net-work. Moreover, Macroscope does not require a strict time syn-chronization between the in-network capturing device and the endhosts because the reutilization of ports opened by clients is done ona very large time scale (e.g., hours) and these are actually used tosynchronize the packet traces with the connection table samples.

Collector: The Collector aggregates data received from multiple

hosts and joins the application-level data with the network leveldata. It first constructs flows (TCP/UDP sessions) from the packettraces using Bro [1]. It then combines the data in the XML connec-tion table samples with the flow data by joining connection tuples.This data is used for analysis.

Analyzer: The Analyzer consists of a set of tools (implementedin Python) for extracting dependencies, querying dependencies andproducing graphical visualizations of the dependencies. These aredetailed in the next section. Macroscope can visualize the static de-pendencies at each of the level of granularities defined in Sec. 2.1,for a single application or for a group of applications. In addi-tion, we can produce visualizations of the dependencies weightedby their usage frequency or traffic volumes, and depict the temporalevolution of dependencies.

Caveats of sampling: Our design that samples application-leveldata at end hosts has interesting tradeoffs. By increasing samplingfrequencies, we can obtain more accurate snapshots of application-level data but this incurs higher resource overhead at end hosts.With the five-second sampling period, the overhead is negligible,but this sampling approach has some caveats.

First, the sampling can miss very short-lived flows. To compen-sate for the loss of accuracy, we use simple statistic tools as shownin the next section. Second, the connection sampling may hide in-formation about the actual application. An application can use forksuch that the same application appears as having different types ofdependencies. Plug-ins (e.g., browser plug-ins) may also make ap-plication dependencies vary from one instance to another. Also,some connections could be delegated by applications to local OSdemons acting as proxies (e.g., some applications delegate DNSqueries to the lsass service).

Despite these caveats, the evaluation presented in Section 4 showsthat Macroscope achieves good performance for detecting applica-tion dependencies.

3.2 Algorithm to Identify Static DependenciesWe now describe our algorithm for identifying the static depen-

dencies of an application. The inputs to this algorithm are all the(sampled) connections outgoing from the application (the 5 tupleconnection information on IPs, ports, and protocol), along with theprocess ID (PID) and the user identifier of the application instancethat initiated the connection.

We use a two-step algorithm. In the first step we classify appli-cations into one of two types: either it only generates connectionsproduced from static dependencies, or it generates connections be-longing to both static dependencies and transient relations. Appli-cations that use transient connections are those that use ephemeralports and/or peering connections such as in P2P applications. Itturns out that we do have a number of applications with none, orvery few, transient connections. We separate our applications intothese two classes for the following reason. Those applications thatgenerate connections from both static dependencies and transientrelations will require a second processing step in which the tran-sient and static components are further separated. However thosefalling into the class that has only static dependencies need no fur-ther processing in terms of identifying the static dependencies be-cause all of the dependencies in this class are static.

The second step of our algorithm takes the data from the appli-cations with connections arising from both static dependencies andtransient relations, and tries to distinguish them. To do this, wemake use of usage frequency information (defined below). A cen-tral idea is that we assume that static dependencies are commonacross instances of the same application, whereas transient con-nections are not. We believe that our traces are large enough (in

232

number of application instances and users) so as to have enoughsamples to differentiate these cases quite accurately. For example,static dependencies use the same port every time while transientconnections typically use ephemeral ports. Frequently ephemeralports are randomly assigned (although not always) and thus theirusage frequency can be as low as 1/(#ephemeral ports) (thenumber of ephemeral ports can be as high as 60,000) whereas theport usage frequency for static dependencies will typically be muchhigher.

Classifying applications: To separate our applications into thetwo classes described above, we use the following simple approx-imation method. Let Na be the number of instances of applica-tion ‘a’ in the traces, that we compute by counting the numberof distinct (PID, IP ) tuples for a. Na

s denotes the number ofinstances of Na that used service s where s refers to a specific(port, protocol) pair. Let Sa denote the total number of services,or (port, protocol) pairs observed by all instances of applicationa in all the traces. We now define V a

s = Na − Nas to capture

the number of application instances that did not use service s. Wedefine the metric Ma to be Ma =

√∑s (V a

s )2/Sa.This metric can be interpreted as follows. When all application

instances use all services, then Ma = 0. The larger Ma is, thefarther away the application is from being one with purely staticdependencies, i.e. it is more and more dominated by transient con-nections. The maximum value of Ma = Na −1 occurs when eachservice is used only once by one of the application instances. Notethat Ma rapidly increases when transient connections are used. Forexample, suppose each instance of an application a uses the same10 static dependencies and has only 1 transient connection. Theneach of the transient connections will have its own variable entryVs and the value of Ma will easily explode for traces with manyapplication instances.

We compute Ma for each application. We classify an applica-tion as having only static connections, if Ma lies below a thresholddefined as follows. Since Ma ranges from 0 to Na−1, we say thatif Ma < T (̇Na − 1) where T is a percentage, then the applicationhas no transient relations. For the value of T we use 85%. In Sec-tion 4 we show that our classifying heuristic is somewhat resilientto the value of the threshold T , and thus we need not worry aboutfine tuning its value. Clearly this heuristic does not mandate thatthe number of transient connections for such applications needs tobe exactly zero. Our heuristic may still classify an application ashaving only static dependencies even if a small number of transientconnections are made or if these are used by a lot of the applicationinstances. Labeling transient relations as static dependencies willresult in false positives for our method. But as we will see in Sec-tion 4, Macroscope has very few false positives and there are quitea number of applications with zero transient relations.

There are a variety of heuristics that could be defined to carry outthis step (e.g., set a threshold for the maximum number of staticdependencies, an application with more distinct service usage islabeled as having transient connections); however, the one here isintuitive, simple and works well.

Identify Static Dependencies: Having identified those applica-tions that contain essentially only static dependencies, we can re-move them from further dependency analysis. We now focus onlyon the applications that generate connections due to both static de-pendencies and transient relations. We extract the static dependen-cies using the following method. First, we consider all serviceslistening on ports below 1024 as static dependencies (regardless oftheir usage frequency). This is because these ports are not typicallyused as ephemeral ports and thus there is a high likelihood that

these represent known maintained services. Second, any remainingservices, i.e., (port, protocol) tuples, are labeled as static if theysatisfy two criteria on usage frequency.

Let Uall denote the set of all users, and Ua ⊂ Uall are theusers that employ application a. Ua

s ⊂ Ua represent the usersthat have connected to service s through application a. We labelapplication a as having a static dependency on service s, if both ofthe following hold:

|Uas |

|Ua| ≥ U andNa

s

Na≥ I

U and I are two thresholds. In our implementation, we picked10% as the relative threshold for these two ratios. This value wasderived experimentally and works quite well for our data. However,we also show in Section 4 that the method is not very sensitive tothis parameter.

We use both of these criteria for the following reasons. If weconsidered application instances only, then a single user, that fre-quently used a particular application could bias the ratio Na

s /Na

by generating a huge number of instances that use a particular ser-vice. For example, such a user could contact a particular peer often,and that peer can spuriously appear as a dependency (false positive)if it is using the same ephemeral port to provide peering service. Toavoid this, we require that a minimum of users use the applicationand the service as well. We elect not to employ user frequencyonly since this detection can be inaccurate with long traces. Whenusing long traces, the same transient connection can occur in multi-ple traces (since there is a finite number of ephemeral ports), whichwould also result in false positives.

Adjustments due to sampling: Macroscope’s method of period-ically sampling the connection table can miss very short connec-tions. Thus, the usage frequency of short connections will be af-fected, possibly leading to an undesired bias towards mislabelingshort connections as transient connections. For this reason, weuse an updated value for the usage frequency of a service basedon its detection probability (i.e., probability of being sampled bythe Tracer in the connection table). The detection probability isdictated by the length of the connection to the service. If the av-erage connection duration is d and Macroscope’s connection tablesampling period is W > d, we define the detection probabilitypd = d/W . For those connections shorter than W , we boost theusage frequency (ratio of instances using a service) by 1/pd. Weonly apply this adjustment if multiple samples of the service us-age are available, to avoid spurious boosts. This approach may beslightly over-boosting when dependencies are regularly used mul-tiple times by each instance, but this is acceptable since such de-pendencies are likely to be static ones.

4. EVALUATIONThe data traces used in this paper were captured on 52 employee

laptops (assigned to individual owners) in a large enterprise overa period of 11 weeks. All the instrumented systems ran a stan-dard corporate build of Windows XP SP2. Each of these systemshad a suite of enterprise applications (security and management re-lated) installed on them. Users are free to install additional softwareon their systems with the exception of p2p applications, which arebanned, except Skype. Data collection on the laptops proceededeven as the users moved between home and work. Correspond-ingly, trace files correspond to one of three modes: (i) inside thecorporate network, (ii) outside, but connected to the VPN and (iii)outside and not on the VPN. Since we are chiefly interested in sce-

233

LDAP AD (389)Web (80)Web (80)

Active Directory (1025)Proxy Discovery (9090)

Communicator exe

SIP over TLS (5061)

EPMAP (135) HTTPS (443)

Proxy Service (911)

Communicator.exe

EPMAP (135) HTTPS (443)

(a)

Bi Fi i (63422) EPMAP (135)BigFix service(63422) EPMAP (135)

BESClient exe

47.4% 2.8%

Active Directory (1025)

BESClient.exe

LDAP AD (389)

44.9% 1.7%

AD logon and directory replication(1026)

3.1%

(b)besclient.exe 10.3.22.142

Active directory(1025) 143.183.2.91 EPMAP(135) 143.183.2.91 EPMAP(135) 10.3.86.40 BigFix service(63422) 143.183.2.139 BigFix service(63422) 143.183.2.152 BigFix service(63422) 143.183.2.38BigFix service(63422) 143.183.2.145 BigFix service(63422) 143.183.2.68 BigFix service(63422) 143.183.2.141

besclient.exe 10.3.22.249 besclient.exe 143.183.163.255besclient.exe 10.3.23.61

LDAP AD(389) 143.183.2.91

besclient.exe 10.3.21.217 besclient.exe 143.183.167.95 besclient.exe 143.183.166.116

AD logon and directory replication(1026) 10.3.86.40

besclient.exe 143.183.247.208 besclient.exe 10.3.21.15

EPMAP BigFix AD logonLDAP AD

(c)

outlook.exe 10.3.23.61

tcp 1343 10.19.19.12 tcp 1339 10.19.19.12 tcp 1346 10.22.226.45tcp 1439 10.19.19.12 Web(80) tcp 1331 10.22.226.45 tcp 1380 10.22.226.45 Active directory(1025)tcp 1199 10.2.167.99MS Global Catalog or LDAP Queries(3268) 143.183.2.91 tcp 1377 10.22.226.45 tcp 1300 10.22.226.208tcp 1481 10.19.19.12 Intel Proxy Discovery(9090) EPMAP(135)tcp 1371 10.19.19.12tcp 1506 10.19.19.12

outlook.exe 10.3.21.217

tcp 1317 10.22.226.208tcp 1317 10.19.19.12 tcp 56165 10.2.167.99tcp 1299 10.19.9.33

outlook.exe 143.183.167.95

tcp 5574 10.22.226.208 tcp 1359 10.22.226.208 tcp 1428 10.22.226.208 tcp 1302 10.22.226.208 tcp 1265 10.22.226.208 tcp 1369 10.22.226.40tcp 1318 10.22.226.208 tcp 3375 10.22.226.208 tcp 9198 10.22.226.208 tcp 1329 10.22.226.45tcp 1324 132.233.42.1 tcp 1295 10.22.226.208

outlook.exe 143.183.163.255

tcp 1335 132.233.42.1 tcp 1307 132.233.42.1tcp 25078 10.19.19.12 tcp 1323 132.233.42.1tcp 1196 10.22.226.208 tcp 1305 10.19.19.12

outlook.exe 143.183.166.116

tcp 4846 10.22.226.207 tcp 1313 10.19.19.12 Intel Proxy Service(911)

outlook.exe 10.3.22.249outlook.exe 10.3.22.142 outlook.exe 143.183.247.208 outlook.exe 143.183.249.93

Web ProxyEPMAPProxy Discovery AD

(d)

Figure 3: Macroscope Sample Charts: (a) Static dependencies for MS Office Communicator, (b) Static dependencies for BigFixBESClient (weighted by % of outgoing traffic volume), (c) Instance level dependencies for BigFix BESClient (top application in-stances, bottom service instances), (d) Distinct port connections for MS Outlook (top application instances, bottom services); redlines denote static dependencies.

narios that aid the IT operators inside the enterprise, we only workwith the (subset of) data collected when the user is inside the cor-porate network. 2

In this section, we first describe the capabilities that Macroscopeenables and present some sample visualizations (§4.1). We thenevaluate the accuracy of our dependency extraction methodologyand compare it to an established method (§4.2). Finally, we high-light a few interesting results that are enabled by Macroscope (§4.3).

4.1 Macroscope CapabilitiesMulti-level Dependencies: Macroscope can discover all the levelsof granularity of the application service relations defined in Sec-tion 2.1. Filters can be applied to restrict only a subset of the ser-vices to be used in these relations. For instance, we can restrict thedisplayed services to only ones of static dependencies, use a subsetof the data (in duration or number of users) or show dependenciesbased on time of day. Further, Macroscope can present these rela-tions for a single application or for a set of applications.

As an example, Fig. 3(a) presents a visualization of the static de-pendencies for MS Communicator as uncovered by Macroscope.This chart was generated automatically, with the application nameprovided as input. Seen in the figure, (tcp, 911) represents anHTTP proxy service used to communicate with hosts outside theenterprise, while the service (tcp, 9090) is a proxy discovery ser-vice that hosts use to locate the nearest proxy server.

At a different level of granularity (level 4 in § 2.1), Fig. 3(c)is a visualization of all the service instances used by instances of

2Labeling the traces as inside/outside/VPN has been done at thecollection time, by querying a host resident proxy configurationservice and by considering the network interface being used. In theabsence of such support, this labeling could be done using knowl-edge of enterprise IP address range and other information (e.g., inour case, all connections outside the enterprise are done through aproxy).

the BigFix client application; upper vertices correspond to appli-cation instances (PID, IP ) and the lower vertices represent ser-vice instances (IP ′, protocol, port). In this figure, we see repli-cated services corresponding to the static dependencies (no tran-sient connections were observed); we also see that most applica-tion instances have a regular pattern of using most static servicesand load balancing between them. In contrast, the access pattern inFig. 3(d) is very different. Here, the black lines indicate transientconnections, while red lines capture the static dependencies. Notethat (i) there are few static dependencies relative to the transientdestinations, and (ii) access patterns vary a lot across instances.There is an intuitive reason to expect a difference in access patternsbetween figures 3(c) and 3(d); BigFix is an enterprise applicationthat runs without any user involvement while Outlook is almostcompletely user driven.

Dependency Profiling: Macroscope’s procedure of joining infor-mation creates rich data that enables the creation of profiles forapplications and their dependencies. In particular, Macroscope canlabel static dependencies by their usage frequency and traffic vol-ume (incoming/outgoing) but also find (temporal) relationships be-tween static dependencies such as the causal ordering introduced inSection 2.

Fig. 3(b) shows the static dependencies for the BigFix BES-Client application, labeled by the percentage of outgoing trafficeach dependency contributed relative to the total outgoing trafficof the application. As one can see, the outgoing traffic pattern ofthis application is dominated by the BigFix service (it’s main pur-pose) and by the traffic to the Active Directory service. Macroscopecan present this type of data from a selected subset of data (e.g., aperiod in time, number of users) and also show the variation of theweights in time. Finally, Macroscope can also show how much oftotal incoming/outgoing traffic each application is responsible forand how many rejected flows each application generates.

234

End-point + Packet Traces Packet traces onlyApplication True Static Macroscope Orion

Dependencies Best fn Best fptp fp fn tp∗ fp fn tp∗ fp fn

Communicator 8 8 0 0 8 17 0 4 3 4PWconsole 8 7 0 1 6 19 2 1 0 7FrameworkService 6 6 0 0 5 20 1 1 0 5Outlook 8 8 10 0 8 11 0 8 11 0BESClient 5 5 0 0 5 19 0 4 6 1Winword 8 6 0 2 8 16 0 4 1 4lsass 7 7 0 0 7 40 0 6 4 1SearchProtocolHost 4 4 0 0 4 21 0 4 21 0Skype 3 3 0 0 2 22 1 1 1 2UNS 1 1 0 0 X† X X X† X Xuserinit 1 1 0 0 X† X X X† X Xconf 2 2 1 0 1 0 1 1 0 1

∗One of these true dependencies is manually added by the user.†Since these applications have a single static dependency, this approach is not feasible.

Table 2: Accuracy of Static Dependency Detection

Executable App. Name/DescriptionCommunicator.exe MS Office CommunicatorPWConsole.exe MS Office Live MeetingFramework- McAfee VirusScan Framework Service:

Service.exe component of a antivirus productOutlook.exe MS OutlookBESClient.exe BigFix BESClient: remote

administration softwareWinWord.exe MS Office Wordlsass.exe MS Local Security Authority Subsystem

Service: enforces Windows securitySearchProtocol- MS Windows Search Protocol Host:

Host.exe used for desktop and network searchSkype.exe SkypeUNS.exe Intel Active Management Technology

notifier service clientuserinit.exe MS Windows Userinit Logon:

sets up network connections and shellconf.exe A process belonging to MS NetMeeting

Table 1: Applications used to evaluate Macroscope’s accuracyin detecting static dependencies

4.2 Accuracy of static dependency detectionTo evaluate Macroscope’s accuracy in determining static depen-

dencies, we choose 12 applications for which we compare Macro-scope with (1) manually verified (ground truth) static dependen-cies and (2) we also compared it against the results returned byOrion [9], the state of the art method for extracting dependenciesby only looking at packet traces. The 12 applications are listed inTable 1. The choice for these applications was based on their pop-ularity and our interests. The chosen applications represent a bal-anced mix between applications which use transient connections(7) and applications without transient connections (5).

In an ideal scenario, ground truth static dependencies would haveto be derived from a careful analysis of the application source code,configuration files, and even the network itself. However, this isquite impractical to realize since we do not have access to thesource code. In this paper, we instead rely on operator knowledgeand common sense to extract the confirmed static dependencies inthe data proceeding as follows. For every network flow, taggedwith an application name, observed in the traces: (i) we check thedestination server and port against a list of known servers and portsused inside the enterprise (these are all the infrastructure serversformally managed by the IT department); if we have a match, the

connection corresponds to a static dependency, else (ii) we attemptto manually connect to the server and port (believing that infras-tructure servers do not go away in a period of a few months) andlabel the dependency as static if the connection succeeds; if thatdoes not work, as a last resort (iii) we asked the IT operators in ourenterprise to verify if they believed the service to be persistent. Ifnone of these methods classified the dependency as static, we as-sumed it was transient. Note that this is is a conservative approach;IT operators cannot keep track of the thousands of services that aredeployed and their answers are based on a best guess.

Our described approach may miss some short-lived extant staticdependencies due to sampling. Given that our traces were collectedover a relatively long period and that static dependencies should beobserved frequently, we believe that the missing dependencies arevery small in number. For example, for a 25ms dependency an ex-pected number of approximately 200 occurrences are required forone sample; assuming even only one usage per application instancethis would necessitate 200 application instances; many of the appli-cations that we profile have seen a lot more instances.

Another artifact of the data collection process has to do with thenon-transparent proxy required to contact external destinations. Allconnections to the outside are effected through a proxy, which hasthe effect of collapsing all the static dependencies to outside desti-nations on the proxy nodes.

Finally, while we expect the DNS service to be a static depen-dency for most applications, we are unable to attribute DNS con-nections correctly to some applications. This is because the win-dows kernel delegates DNS lookups to a third party (lsass.exe).Thus, in almost all the cases, the static dependencies of an applica-tion do not include DNS.

To compare Macroscope’s accuracy in detecting static dependen-cies to that of approaches based on packet traces only, we have im-plemented Orion [9], the current state of the art approach in thisspace. This implementation represents our best effort based onthe description in the paper and discussions with authors. We useBro [1] to generate flow information from the packet traces (but webreak flows using keepalive messages as in [9]).

Orion (along with Sherlock [5] and eXpose [12]) infer relation-ships between services based on temporal co-occurrence. Appli-cation context is not recorded; instead, applications are manuallyassociated (seeded) with a set of dependent services. For example,suppose the services (tcp, 53), (tcp, 80) and (tcp, 443) are relatedbased on the co-occurrence. One can supply the knowledge that

235

Users 0 5 10 15 20 25 30 35 40 45 50

0

20

40

60

80

100

120

Distinct−port connectionsReal Static Dep.Macroscope Static Dep.

(a) Communicator

Users 0 5 10 15 20 25 30 35 40 45 50

0

50

100

150

200


(b) Outlook

Users 0 5 10 15 20 25 30 35 40 45 50

0

1

2

3

4

5

6


(c) FrameworkService

Users 0 5 10 15 20 25 30 35 40 45 50

0

20

40

60

80

100

120

140

160


(d) PWConsole

Figure 4: Convergence of dependency finding vs. Number of traces

Dependencies

I = U0 0.1 0.2 0.3 0.4 0.5

0

20

40

60

80

100

120MacroscopeReal

(a) I = U

Dependencies

I (instance threshold)0 0.1 0.2 0.3 0.4 0.5

0

20

40

60

80

100

120MacroscopeReal

(b) U = 10%

Dependencies

U (user threshold)0 0.1 0.2 0.3 0.4 0.5

0

2

4

6

8

10

12

14

16

18MacroscopeReal

(c) I = 10%

# Misclassified as having transient conn.

T (threshold)0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95

−2

−1

0

1

2

3

# Misclassified as nothaving transient conn.

(d) Labeling apps. with transient con-nections.

Figure 5: Sensitivity of static dependency detection. First three charts: Number of static dependencies identified by Macroscope fordifferent thresholds of I (instances) and U (users) for Microsoft Communicator. Last chart: accuracy of detecting the applicationswith transient connections.

(tcp, 443) a dependency of the web browser traffic. With this seed,the other two services (related to the seed) also become associatedwith the web browser application.

Main Results: Table 2 synthesizes the evaluation of Macroscope’saccuracy in detecting static dependencies by comparison with theground truth static dependencies and Orion’s results. In Table 2,true positives (tp) are the actual static dependencies detected bythe algorithms. False positives (fp) are dependencies identified asstatic (by Macroscope or Orion) but which are not static dependen-cies in reality. False negatives (fn) are static dependencies thatare not detected as such (are “missed”) by the applied algorithms.

For each application, we use two different seeds for Orion; onewhich optimizes the number of false positives and the other op-timizes for false negatives. Table 2 summarizes the results. Thenumber of true positives shown in Table 2 for Orion includes themanually specified dependency. To detect the best seeds, we haveinspected the results of all the different static dependencies whenpicked as seeds. For clarity, in Table 2 we use a single entry forthe services that are offered on both UDP and TCP (DNS, LDAP).In this case, we count one false positive if the service is spuriouslydetected as a dependency on at least one of the two transport pro-tocols and count one false negative if none of the two are detected.We aggregate the Orion delay distribution bins to reflect the depen-dencies used in this paper and improve the Orion data set [9].

Table 2 shows that Macroscope has very few false positives orfalse negatives: it detects over 95% of the static dependencies andhas a low number of false positives, amounting to about 18% ofthe number of static dependencies. Also, Macroscope offers a sig-

nificant improvement compared to Orion: in the case of the bestseed pick for false negatives, Orion detects about 91% of depen-dencies (the manual seeds were counted as actually detected), buthas a high false positive ratio, over 315% compared to the numberof actual static dependencies. In the case of the best seed pick forfalse positives, Orion has fewer false positives, amounting to about80% of the number of static dependencies, but detects only about57% of the static dependencies. Any other seed results in worseperformance in at least one of the two metrics.

It is expected that Macroscope would be superior to the approachesthat only use packet traces, because it has more information at itsdisposal. Here we quantify that gain. The results show that Macro-scope is able to maintain both low false positives and low falsenegatives, whereas Orion in this case can attain at most one of thesegoals.

Macroscope’s false positives occur due to mislabeling, resultedfrom applying its algorithm. As shown, for most applications Macro-scope has at most one false positive. Outlook stands out in thetable because of the large number of false positives. We discov-ered that the connections to the mail server (all mail in the enter-prise is delivered over an MS Exchange connection) tended to reuseephemeral ports and the connections were long lived; some of thesetransient ports are used by up to 18 users (34% of all users) and upto 15% of the instances, quite above the usage frequency of manystatic services in general. Macroscope’s false negatives can occurfor two reasons: (1) lack of sampling of any connection to thatservice, or (2) misclassification of Macroscope’s algorithm due toinfrequent usage, the cause for the false negatives in Table 2.

236

Sample Period 5s 10s 20s 30sI=U=10% tp 95% 85% 84% 82%

fp 18% 16% 16% 16%I=U=5% tp 95% 92% 87% 85%

fp 52% 46% 83% 82%

Table 3: Effect of sampling rate and filtering thresholds.

Sensitivity Analysis: We now evaluate the effect on detecting staticdependencies of the number of traces, of Macroscope’s parameters(§ 3.2) and of the sampling rate.

Figure 4 presents the number of static dependencies of four ap-plications as extracted from an increasing subset of user traces (auser is represented by an individual laptop). The figure shows thatMacroscope converges towards the actual number of static depen-dencies as more data is available.

Fig. 5 shows the number of identified static dependencies forMS Office Communicator, given different values of the the instance(I) and user (U ) relative thresholds (parameters described in Sec-tion 3.2). Both I and U are varied in the range 0%–50%. Fig. 5(a)shows the number of identified static dependencies when I = U ,in Fig. 5(b) U is set to 10% and only I is varied, and in Fig. 5(c),I is set to 10% and U is varied. As one can see, there is a knee ofthe selection threshold above which the number of identified staticdependencies decreases very slowly (almost remains constant), butmay result in a few false negatives; in this context, our goal is to setthe threshold at the knee of this curve.

Fig. 5(d) evaluates the sensitivity of the algorithm that classifiesapplications into using transient connections or not to its parameterT (§3.2). Fig. 5(d) shows how many applications were misclas-sified out of the 12 applications used in Table 2; a positive valuemeans misclassified as having transient connections while a nega-tive value means misclassified as not having transient connections(there are no cases when applications are misclassified both ways).In the range of 0.8-0.9 for T the classification is flawless. A valuetoo high for T can lead to misclassifying applications as not havingtransient connections which results in false positive dependencies,while a value too low leaves the dependency detection up to thesecond part of the algorithm, where infrequently used static depen-dencies may turn into false negatives.

Table 3 shows the effects of the connection table sampling ratetogether with the effects of Macroscope’s thresholds on the staticdependency extraction. The table presents the total number of truepositives and the total number of false positives detected by Macro-scope relative to the total number of actual static dependencies(ground truth); the results are computed by adding up (aggregat-ing) all the dependencies for all the applications in Table 1. Wecan see that Macroscope’s accuracy gradually degrades along withthe sampling frequency, and that using a lower threshold to selectstatic dependencies can increase the number of true positives at theexpense of an increased number of false positives.

4.3 Dependency CharacteristicsCharacterizing static dependencies: The distribution of the re-lations between applications and services is very skewed. A fewapplications depend on a lot of services, while most applicationsdepend on few services. Also, a few of the services are used bya lot of applications, while most services are used by one applica-tion. This type of analysis is unavailable to approaches using onlypacket traces and we are not aware of previous works profiling theapplication-service relation for all the applications used in a largeenterprise.

Application Description # of staticdependencies

outlook.exe MS Outlook 18iexplore.exe MS Internet Explorer 11x1service.exe Enterprise search engine 10svchost.exe MS Windows component 9

dynamically loading DLLswinproj.exe Component of MS Project 9communicator.exe MS Office Communicator 8firefox.exe Mozilla Firefox 8spoolsv.exe MS Windows service 7

managing spooled print jobsmmc.exe MS Windows 7

Management Consolepwconsole.exe MS Office Live Meeting 7

Table 4: Top 10 applications with most static dep.

applications0 20 40 60 80 100 120 140 160

0

2

4

6

8

10

12

14

16

18 # Static services used by each app.

(a) # Services used by each app.

services0 10 20 30 40 50 60 70 80 90

0

10

20

30

40

50

60

70

80# Applications using each service

(b) # Apps using each service

Figure 6: Application-Service relation distribution.

Fig. 6(a) shows a chart of the distribution of the number of staticservices used by each of the 159 applications present in the traces asdetected by Macroscope; as we have shown, detection is not perfectbut is quite accurate. The average number of static dependencies is2.5, and the median is 1. There are 73 applications (out of 159) hav-ing only one static dependency as detected by Macroscope. Amongthese, most depend on: the proxy used to communicate with ma-chines outside the enterprise (28 applications), Web (10), SSH (10),HTTPS (5) and LDAP (3).

Table 4 presents the top ten applications with the most static de-pendencies.

Conversely, Fig. 6(b) shows the number of applications usingeach of the services of the static dependencies. The number of dis-tinct services that applications statically depend on is 92, the aver-age number of applications using one of these static dependenciesis 4.6, the maximum is 85 and the median is 1. 62 of these servicesare specific to a single application. Examples of services used by asingle application include POP3 (tcp, 995) , Telnet (tcp, 23), NTP(tcp, 123), IMAP (tcp, 993), McAfee Update (tcp, 8440, 8442),HTTP alternate (tcp, 8080), Intel UNS (tcp, 16993), etc.

Table 5 presents the top ten services associated with static de-pendencies based on the number of different applications that usethem.

Besides characterizing the dependencies of the enterprise appli-cations, these results also offer an extra motivation for our work.The results show that: (1) end-point detection is necessary due tothe large number of applications using only a few services ; (2) it isdifficult to manually create a list of all the services to be verified forfaults (in fact only by profiling 12 applications we have identified a

237

Service Service Name # Apps. use it(’tcp’, 911) Enterprise Proxy 85(’tcp’, 80) Web 64(’tcp’, 443) HTTPS 36(’tcp’, 9090) Proxy Discovery 30(’tcp’, 135) EPMAP 23(’tcp’, 389) LDAP AD 15(’tcp’, 1025) Active Dir. (AD) 13(’tcp’, 22) SSH 13(’tcp’, 3180) MS SQL Svr. 8(’tcp’, 21) FTP 5

Table 5: Top 10 services used by most applications.

searchprotocolhost.exe

Active directory(1025) EPMAP(135) Web(80)DNS(53)

besclient.exe

AD logon and directory replication(1026) LDAP AD(389)BigFix service(63422)

desktopmgr.exe

tcp 1332

uns.exe

Intel AMT TLS connections(16993)

pwconsole.exe

SIP over TLS(5061) tcp 8057 udp 161HTTPS(443)Intel Proxy Service(911) Intel Proxy Discovery(9090)

outlook.exe

tcp 1343 tcp 56165tcp 1298 tcp 1221 tcp 1199tcp 1365 tcp 1315 tcp 1449 tcp 1335 tcp 12893

conf.exe

Windows Live Messenger(1503) Netmeeting(1720) udp 49609

lsass.exe

Kerberos(88)

frameworkservice.exe

McAfee Update(8440) McAfee Update(8442) FTP(21)

winword.exe communicator.exeuserinit.exe x1service.exe

tcp 1253 tcp 1299 tcp 24857 tcp 1324 tcp 1312 tcp 1091 tcp 2242

Active Directory

DNSWeb

Proxy ServiceProxy Discovery

LDAPEPMAPHTTPS

Figure 7: Static dependencies of 12 applications.

lot of services not on the engineering list of well known services)3;and (3) isolating faults for most applications should be fast (due tothe small number of static dependencies).

Transient connections vs static dependencies: Out of the 159 ap-plications encountered, our algorithm identifies 136 as not havingtransient connections and 23 of these as having transient connec-tions. For applications without transient connections, Macroscopeachieves its best accuracy in detecting static dependencies sinceit detects no false positives and, for large traces, false negativesshould be very few. As a surprising side note, several transientservices (destinations of application outgoing connections usingephemeral ports) are used by multiple applications, for examplethree of these services are used by 6 different applications.

Inside vs Outside Services: Out of the 159 applications, 73 com-municate only with internal services, 51 communicate only withexternal services while 35 use both internal and external services.This quantifies the level of service an enterprise IT managementdepartment can provide its employees when it conducts fault isola-tion, server replication, and so on (assuming it cannot fix problemson external servers).

Multi-application dependencies: Fig. 7 shows the static depen-dencies of all the 12 applications used in Table 2; the upper ver-tices represent the applications while the lower vertices representservices). There are quite a few common services among these ap-plications, some of these are highlighted in the figure.

Profiled Dependencies: Fig. 8 shows the static dependencies forthe McAfee Virusscan Framework Service application, and each

3If this list were to be complete then Macroscope can classify con-nections using this oracle and accuracy would be almost perfect(with at most a few, not sampled, false negatives).

0 4%

Proxy Discovery (9090)

FTP(21)0.4%

Proxy Service (911)

10.4% 2.6%Frameworkservice exe

Web(80)38 5%

44.1%50.2%

McAfee Update (8440)

Frameworkservice.exe

McAfee Update (8442)

38.5%

Figure 8: FrameworkService - weighted static dependencies byusage frequency (#uses / #instances).

Usage Frequency

Time (x 2Days)

0 5 10 15 20 25 30 35 400

0.2

0.4

0.6

0.8

1(’tcp’80)(’tcp’443)(’tcp’911)(’tcp’9090)(’tcp’135)(’tcp’389)(’tcp’1025)(’tcp’5061)

Figure 9: Communicator - dependency usage frequency in time(2 day slices, curvefit).

dependency is labeled with its usage frequency (i.e., the ratio be-tween the number of instances of the application for which Macro-scope detected the use of the dependency and the total number ofinstances of the application). Note that even if the usage frequencyof some dependencies is lower than the 10% threshold used to se-lect static dependencies, they are considered static dependenciesbecause this application is classified as not using transient connec-tions. From Fig. 8 one can see that the two update services are usedwith comparable frequencies and that half of the instances used theweb.

An interesting feature of Macroscope is that it can trace the vari-ation of dependencies in time. Fig. 9 shows the usage frequency forthe static dependencies of the Office Communicator for multiple 2day periods; the usage frequency was extracted from each of the 2day periods independently. As the chart shows, these weights arefairly stable in time, which indicates that the application code basedand its usage pattern were constant throughout the traces. This finegranularity detection is difficult to achieve when using only packettraces due to the larger number of samples required for an accuratemeasurement.

5. DISCUSSION AND FUTURE WORK

5.1 Macroscope UsageAs we showed previously, Macroscope can generate visualiza-

tions for the application and service dependencies at various granu-larities and scopes. Here, we walk through some concrete use cases

238

where network operators can use the visualizations effectively tosolve a (network) management related activity. We intend the fol-lowing discussion not as a particular prescription of how Macro-scope should be used (the discussion is high level), but more as away to illustrate the different ways in which Macroscope can aidnetwork operators in carrying out their tasks more effectively.

Fault Localization: One of the common complaints registeredwith IT helpdesks in enterprise networks relates to applications(email, intranet apps, etc.) not functioning. In such a case, the de-bugging process normally involves rebooting as a first step (whichalmost always leads to a diatribe from the user about IT operatorsand their limited knowledge). The situation could be very differentwith Macroscope in place; the operator could generate a graph suchas Fig. 3(a) and compare the last snapshot of activity (where thefault occurred) to a snapshot derived from a different time. Miss-ing edges in the visualization graph are easily identified.

While this does not describe the entire troubleshooting process,the fact is that a tool such as Macroscope allows the operators toreason about the problem aided by the visualization, rather thanrecite from a canned playbook (“step 1: reboot computer and tryagain”). Macroscope can also generate application specific visual-izations across a group of malfunctioning applications as shown inFig. 7 or across a group of users (as shown in Fig. 3(c)). If the IToperator(s) receive a number of trouble tickets, such a visualizationcan tell them what is working, and what else needs to be looked at.Given this information, the operators can easily generate a plausi-ble set of servers/services to check. The visualizations generatedby Macroscope can also improve efficiency; nodes in the graph canbe ordered by degree or weight and operators can search for faults“greedily”.

Load Balancing, Replication and Capacity Planning: In mostmodern enterprise networks, important infrastructure services arereplicated at each geographic site (to reduce latency and cut downbandwidth costs). However, the utilizations across these serversvaries dramatically. A routine network maintenance task is that ofplacing additional resources (servers, storage, etc) to improve per-formance. Macroscope can issue queries that can inform how usersare “pinned” to servers across locations and to identify the loca-tions that can benefit the most from an upgrade. By the same coin,the visualizations can also inform network operators how many ap-plications and users will be impacted by a single service outage.

Consider Figures 3(c) and 7. Clearly, not all the services areequally significant. The network that wants to upgrade gets thebest “bang for the buck” by upgrading hardware (or managing theutilization in some other way) for the services/servers that have alot of incident edges.

Detect Malicious or Misbehaving Applications: Macroscope canenable operators to identify and pinpoint misbehaving or maliciousapplications. In many malware instances, a popular application isreplaced by a trojaned version whose behavior is different from theoriginal application. Macroscope enables the building of applica-tion “profiles” that can help detect these changes. To describe it ata high level, the static dependencies and their associated properties(usage frequency, traffic weight, destinations) essentially describethe profile of an application. This profile can be built over time (andfor individual users). The historical computed profiles can be com-pared to profiles generated from more recent data. Fig. 9 shows theusage frequency of different services over a window of time (oneof the elements in the profile). It is very likely that a trojan appli-cation that replaces it will perturb the usage frequency, which canbe detected. Moreover, Macroscope can detect applications doingport scanning by looking at the number of rejected connections.

5.2 Future WorkCausal order of dependencies: Macroscope can evaluate the causal-ity relation between dependencies. We say two dependencies A andB are causally ordered (and write A->B) if A usually finishes justbefore B is started (and the vice-versa is not true). In this context,“before” means within a time window. These constructions allownetwork operators to understand how services are composed, to setan order among the services to be verified for faults and to improveoverall application responsiveness (the latency of the application isrelated to the latency of the longest path in such a graph). We haveimplemented this algorithm, but due to the unavailability of groundtruth for the causal order (to validate the results), we do not includeany such results in this paper.

Service side dependencies: If server side traces are available in ad-dition to client side traces (used in this paper), our work can be ex-tended to create complex dependency graphs, of depth higher thanone; e.g., not only capture that application A depends on service S,but also that S itself (the application implementing it) depends onS1, and that S1 depends on S2, and so on.

Kernel-level tracing: Macroscope’s tracing method can be furtherimproved. The connection table sampling can be replaced withfine granularity monitoring of system calls in the kernel. In thisway, UDP dependencies can be detected without the use of packetlevel traces; however, packet level traces would still be useful forprofiling dependencies (e.g., associate them traffic volumes) andfor identifying unsuccessful connections. Also, by tracing inter-process communications, this method may more easily detect thedependencies accessed indirectly through the use of an OS servicesuch as lsass. The disadvantage of such a method may be the trac-ing overhead, which is proportional to the network traffic (i.e., theoverhead is higher when the computer is busier), unlike the almostconstant overhead of the connection table sampling.

Identifying applications: In the future, the Tracer could identifyapplications by using a hash of the executable file instead of the ex-ecutable name, to avoid name collisions. However, this may thor-oughly reduce the available samples, e.g., different compilers cangenerate different executable files.

6. RELATED WORKIn recent years, extracting networked application dependencies

has been recognized as a critical task in the larger problem of man-aging enterprise networks and data-centers. Previous efforts to ad-dress this problem, notably Sherlock [5], eXpose [12], and Orion [9],are based on network-level tracing. That is, they operate solely onnetwork traffic traces to extract the dependencies. Sherlock usestime correlation of packets between different services and com-putes conditional probability to decide dependencies [5]. eXposepartitions the packet trace into time windows and looks for servicedependencies that occur within the same time window by comput-ing a modified JMeasure from an activity matrix [12]. Orion com-putes spikes of delay distribution of flow pairs and uses thresholdsto decide if two services depend each other [9]. Constellation [6]uses machine learning algorithms to extract dependencies from thetimings of packets sent and received. These methods require nomodification or instrumenting of individual end-hosts and hencecan be deployed quite easily. However, as we show in this pa-per, they require manual steps and can have high false negativesand positives. In Macroscope, we take a different approach, choos-ing to instrument end-hosts to collect application level data. Thus,Macroscope is harder to deploy but the benefit is that we overcomethe challenges faced by methods relying only on network data.

239

To address the challenge of debugging distributed systems, re-searchers have studied application [10] or middleware executionpaths [8] tracing. These systems focus on a particular distributedsystem and require tracing in a particular tracing framework andmodification of applications; thus, they are hard to do in today’scomplex heterogeneous service environment. Project 5 [4] andWAP5 [17] aim to debug distributed systems by taking a black-box debugging approach. They passively monitor messages andinfer causal execution paths from these message traces. Magpie [7]and Pinpoint [8] take a gray-box debugging approach that combinesprior knowledge, observations, and inference. Pip [16] detects de-viation of distributed systems by comparing actual behavior withexpected behavior. In contrast, Macroscope focuses on automaticdependency extraction of networked applications for a large num-ber of applications and services, and does this without applicationmodification.

Network monitoring and application traffic classification has beenstudied extensively in the past. BLINC [14] classifies traffic byusing the behavioral patterns at the social, functional and appli-cation level. Kannan et al. [13] proposed algorithms that semi-automatically extract application session structures from networkconnection-level traces. The term session denotes a set of connec-tions related to a single application task (e.g., FTP session).

7. CONCLUSIONSWe have presented Macroscope, a system that applies end-point

tracing to detect application dependencies on services. Macroscopeis completely automatic, simple enough to be practical, accuratein detecting static dependencies and enables application profiling.In the future, we plan to develop network management algorithmsfor fault isolation and service replication that incorporate Macro-scope’s profiles of the dependency relationship between applica-tions and services.

8. REFERENCES[1] Bro. http://www.bro-ids.org/.[2] Taming technology sprawl. http://online.wsj.com/

article/SB120156419453723637.html?mod=techno%logy_main_promo_left.

[3] Winpcap. http://www.winpcap.org/.[4] M. K. Aguilera, J. C. Mogul, J. L. Wiener, P. Reynolds, and

A. Muthitacharoen. Performance debugging for distributedsystems of black boxes. In SOSP, 2003.

[5] P. Bahl, R. Chandra, A. Greenberg, S. Kandula, D. A. Maltz,and M. Zhang. Towards highly reliable enterprise networkservices via inference of multi-level dependencies. In ACMSIGCOMM, 2007.

[6] P. Barham, R. Black, M. Goldszmidt, R. Isaacs,J. MacCormick, R. Mortier, and A. Simma. Constellation:automated discovery of service and host dependencies innetworked systems. MSR-TR-2008-67, 2008.

[7] P. Barham, A. Donnelly, R. Isaacs, and R. Mortier. Usingmagpie for request extraction and workload modelling. InOSDI, 2004.

[8] M. Y. Chen, A. Accardi, E. Kiciman, D. Patterson, A. Fox,and E. Brewer. Path-based failure and evolutionmanagement. In NSDI, 2004.

[9] X. Chen, M. Zhang, Z. M. Mao, and V. Bahl. Automatingnetwork application dependency discovery: Experiences,limitations, and new solutions. In OSDI, 2008.

[10] R. Fonseca, G. Porter, R. H. Katz, S. Shenker, and I. Stoica.X-Trace: A pervasive network tracing framework. In NSDI,2007.

[11] F. Giroire, J. Chandrashekar, N. Taft, E. Schooler, andK. Papagiannaki. Exploiting temporal persistence to detectcovert botnet channels. In RAID’09.

[12] S. Kandula, R. Chandra, and D. Katabi. What’s going on?learning communication rules in edge networks. In ACMSIGCOMM, 2008.

[13] J. Kannan, J. Jung, V. Paxson, and C. E. Koksal.Semi-automated discovery of application session structure.In IMC, 2006.

[14] T. Karagiannis, K. Papaginnaki, and M. Faloutsos. BLINC:Multilevel traffic classification in the dark. In ACMSIGCOMM, 2005.

[15] Z. Kerravala. Configuration management delivers businessresiliency. In The Yankee Group, 2002.

[16] P. Reynold, C. Killian, J. L. Wiener, J. C. Mogul, M. A.Shah, and A. Vahdat. Pip: Detecting the unexpected indistributed systems. In NSDI, 2006.

[17] P. Reynolds, J. L. Wiener, J. C. Mogul, M. K. Aguilera, andA. Vahdat. WAP5: Black-box performance debugging forwide-area systems. In WWW, 2006.

240

macroscope: end-point approach to networked application...

Documents