DARPA ULTRALOG Final Report - Industrial and Manufacturing ...

More documents

Recommendations

Info

1000 500 0 -500 -1000 0.2 0.4 0.6 0.8 Default Policy Controlled Stress (S) Figure 5. Results Overview • Another difference is that in [34], they assume operating system support for being able to tune parameters such as buffer sizes and operating system settings which may not be true in many MAS-based situations because of mobility, security and real-time constraints. Besides, in addition to the estimation of performance, the queueing model may have the capability to eliminate instabilities from a queueing sense, which is not apparent in the other approach. • But most importantly, their work reflects a two level hierarchy where the resource manager mediates several application environments to obtain maximum utility to the data center. But our work is from the perspective of a single, self-optimizing application that is trying to be survivable by maximizing its own utility. Inspite of these differences, it is interesting to see that the self-controlling capability can be achieved, with or without explicit layering, in real-world applications. 5. Empirical Evaluation on CPE Test-bed The aforementioned framework was implemented within CPE which we use as a test-bed for our experimentation. The main goal of this experimentation was to examine if application-level adaptivity led to any utility gains in the long run. The superior agents in CPE continuously plan maneuvers for their subordinates which get executed by the lower rung nodes. We subjected the entire distributed community to random stresses by simulating enemy clusters of varying sizes and arrival rates. These stresses translated into the need to perform the distributed “sense-plan-respond” more frequently causing increased load and traffic in the network of agents. The stresses were created by a worldagent whose main purpose was to simulate warlike dynamics within our test-bed. The CPE prototype consists of 14 agents spread across a physical layer of 6 CPUs. We utilized the prototype CPE framework to run 36 experiments at two stress levels (S = 0.25 and S = 0.75). There were three layers of hierarchy as shown in Figure 2a with a three-way branching at each level and one supply node. The community’s utility function was based on the achievement of real goals in military engagements such as terminating or damaging the enemy and reducing the penalty involved in consuming resources such as fuel or sustaining damage. To keep our queueing models simple, we assumed that the external arrival was Poisson while the service times were generally distributed. In order to cater to general arrival rates, the framework contains a QNA-based and a simulation-based model. Using this assumption a BCMP or M/G/1 queueing model could be selected by the framework for real-time performance estimation. The baseline for comparison was the do nothing policy (default) where we let the Cougaar infrastructure manage conditions of high load. Although our framework did better than any set of opmodes as shown in Figure 5 for the two stress modes, we show instantaneous and cumulative utility for two opmodes (Default A, B) in particular in Figure 6. We noticed that in the long run the framework enhanced the utility of the application as compared to the default policy. At both stress levels, the controlled scenario performed better that the default as shown in Figure 6. We did observe oscillations in the instantaneous utility and we attribute this to the impreciseness of the prediction of stresses. Stresses vary relatively fast in the order of seconds while the control granularity was of the order of minutes. Since this is a military engagement situation following no stress patterns, it is hard to cope with in the higher stress case. In contrast to MAS applications dealing with data centers where load can be attributed to time-of-day and other seasonal effects, it is not possible to get accurate load predictions for MAS applications simulating wartime loads. We think that this could be the reason why our utility falls in the latter case. In subsequent work, we intend to enhance Cougaar capability to be supportive of the application-layer by forcing it to guarantee some end-to-end delay requirements. 6. Conclusions and Future Work In this paper, we were able to successfully control a realtime DMAS to achieve overall better utility in the long run, thus making the application survivable. Utility improvements were made through application-level trade-offs between quality of service and performance. We utilized a queueing network based framework for performance analysis and subsequently used a learned utility model for computing the overall benefit to the DMAS (i.e. community). While Tesauro et al. [34] employ a resource arbiter to maximize the combined utility of several application environments in a data center scenario, we focus on using queueing
-10 20 15 10 5 0 -5 0 200 400 600 800 1000 1200 time (sec.) Controlled Default A Default B 1400 1200 1000 800 600 400 200 0 0 200 400 600 800 1000 1200 time (sec.) Controlled Default A Default B (a) Instantaneous Utility (stress 0.25) (b) Cumulative Utility (stress 0.25) 20 15 10 5 0 0 200 400 600 800 1000 1200 -5 -10 time (sec.) Controlled Default A Default B 1200 1000 800 600 400 200 0 0 200 400 600 800 1000 1200 time (sec.) Controlled Default A Default B (c) Instantaneous Utility (stress 0.75) (d) Cumulative Utility (stress 0.75) Figure 6. Sample Results theory to maximize the utility from performance of a single distributed application given that is has been allocated some resources. We think that the approaches are complementary, with this study providing empirical evidence to support the observation by Jennings and Wooldridge in [18] that agents can be used to optimize distributed application environments, including themselves, through flexible highlevel (i.e. application-level) interactions. Furthermore, this work has resulted in a general architectural lesson. We believe that any distributed application would have flows of traffic and would require service level attributes such as response times, utilization or delays of components to be optimized. The paradigm that we have chosen can capture such quantities and help evaluate choices that may lead to better application utility. This concept of breaking the application into flows and allowing a real-time model-based predictor to steer the system into regions of higher utility is pretty generic in nature. While we are continuing the empirical evaluation, we keep the building blocks small to ensure scalability and to reduce interactions. We utilize TechSpecs to distribute knowledge and meta-data thus reemphazing the separation principle. Subsequently, we hope to broaden the layered control approach to encompass infrastructure-level control within the framework. Another avenue for improvement is to design self-protecting mechanisms so that the security aspect of the framework is reinforced. Acknowledgements This work was performed under the DARPA UltraLog Grant#: MDA 972-01-1-0038. The authors wish to acknowledge DARPA for their generous support. References [1] Cougaar open source site. http://www.cougaar.org. DARPA. [2] Ultralog program site. http://dtsn.darpa.mil/ixo/. DARPA. [3] A. G. Barto, S. J. Bradtke, and S. Singh. Learning to act using real-time dynamic programming. Artificial Intelligence, 72:81–138, 1995. [4] M. N. Bennani and D. A. Menasce. Assessing the robustness of self-managing computer systems under highly variable workloads. International Conference on Autonomic Computing, 2004. [5] G. Bolch, S. Greiner, H. de Meer, and K. S.Trivedi. Queueing Networks and Markov Chains: Modeling and Performance Evaluation with Computer Science Applications. John Wiley and Sons, Inc., 1998. [6] J. Bredin, D. Kotz, and D. Rus. Market-based resource control for mobile agents. Autonomous Agents, 1998. [7] M. Brinn and M. Greaves. Leveraging agent properties to assure survivability of distributed multi-agent sys-
Page 1 and 2: Ultra*Log PSU/IAI Final Report for
Page 3 and 4: Contents Contents .................
Page 5: Executive Summary Ultra*Log is a De
Page 8 and 9: 2.3 Gnanasambandam, N., Lee, S., Ku
Page 10 and 11: 6 Characterization and analysis of
Page 12 and 13: timizing simultaneously the link de
Page 14 and 15: where ∆ 1 (j) ≥ 0 and ∆ 2 (i,
Page 16 and 17: Table 2. GA Results Agent N A D max
Page 18 and 19: connected component, in which a pat
Page 20 and 21: Random Small-world Scale-free with
Page 22 and 23: Growth mechanisms Start with a smal
Page 24 and 25: Table 2. The proposed network’s c
Page 26 and 27: ¡ ¡ ¢ £ ¤ ¥ ¦ £ § ¨ © ¥
Page 28 and 29: © © ¤ ¢ ¨ ¤ £ ¦ ¨ © §
Page 30 and 31: DMAS Controller. The functional uni
Page 34 and 35: tems. Proceedings of the Second Joi
Page 36 and 37: Proceedings of the 1st Open Cougaar
Page 44 and 45: 1 SITUATION IDENTIFICATION USING DY
Page 46 and 47: 3 3.2 Behavior In SSC society an ag
Page 48 and 49: 5 All the behavior parameters may n
Page 50 and 51: Estimating Global Stress Environmen
Page 52 and 53: chaotic deterministic time series.
Page 54 and 55: 100% 1 95% 2 14 15 64% TAO 4 64% 62
Page 56 and 57: interactions as a dynamical system
Page 58 and 59: ehavior states under varying system
Page 60 and 61: Extensive testing and validation of
Page 62 and 63: 5 Conclusions and Future Research T
Page 64 and 65: 2 to function critically even under
Page 66 and 67: 4 points into a corresponding inner
Page 68 and 69: 6 stresses well. The Warehouse 1 ag
Page 70 and 71: © £ ¨ ¥ ¤ § ¢ ¥ ©
Page 72 and 73: Table 1: Notation Symbol Descriptio
Page 74 and 75: 4 Conclusions and Future Work 4.1 C
Page 76 and 77: O, U Stresses Physical/Infrastructu
Page 78 and 79: Manuscript for IEEE Transactions on
Page 80 and 81: Manuscript for IEEE Transactions on
Page 82 and 83:
Manuscript for IEEE Transactions on
Page 84 and 85:
Page 86 and 87:
Page 88 and 89:
Page 90 and 91:
Page 92 and 93:
Page 94 and 95:
Page 96 and 97:
Page 98 and 99:
Page 100 and 101:
Page 102 and 103:
Page 104 and 105:
Page 106 and 107:
2 discuss problem domain and in Sec
Page 108 and 109:
4 t Si t ∫ + ( ) i ) t When RA i
Page 110 and 111:
6 S UB i ∑ = P LI / LI . (16) i p
Page 112 and 113:
8 policies while underutilizing in
Page 114 and 115:
architecture based on both the Grid
Page 116 and 117:
to machines (network topology) and
Page 118 and 119:
component is a member of one of the
Page 120 and 121:
denotes the immediate predecessors
Page 122 and 123:
limit of large number of tasks. If
Page 124 and 125:
Min-min heuristic algorithm Step 1:
Page 126 and 127:
We set up eight different experimen
Page 128 and 129:
0. 4 8057 8237 0.5 6439 6635 0.6 53
Page 130 and 131:
pp. 191-200. [3] I. Foster, C. Kess
Page 132 and 133:
Technology, Cambridge, MA, 1995. [2
Page 134 and 135:
Manuscript for IEEE TRANSACTIONS ON
Page 136 and 137:
Page 138 and 139:
Page 140 and 141:
Page 142 and 143:
Page 144 and 145:
Page 146 and 147:
Page 148 and 149:
Page 150 and 151:
Page 152 and 153:
Page 154 and 155:
Page 156 and 157:
Coordinating Control Decisions of S
Page 158 and 159:
directly. It has coarser scale. It
Page 160 and 161:
Understanding Agent Societies Using
Page 162 and 163:
• For tasks, the UID of the direc
Page 164 and 165:
for tracking the dependencies betwe
Page 166 and 167:
within the monitored enclave. With
Page 168 and 169:
Figure 1. Agent Hierarchy in CPE So
Page 170 and 171:
Figure 3. The World Model CPY Agent
Page 172 and 173:
Table 2. TechSpecs: Infrastructure
Page 174 and 175:
terms of the average waiting times
Page 176 and 177:
Acknowledgements The work described
Page 178 and 179:
key realization to tackle this prob
Page 180 and 181:
are or ignored perturbations. The e
Page 182 and 183:
sustained oscillations. If the inst
Page 184 and 185:
solved by biological systems for li
Page 186 and 187:
non-linear and non-stationarity. Ne
Page 188 and 189:
D k : Outflow from high priority qu
Page 190 and 191:
the system starts to perform self-s
Page 192 and 193:
sustained in a supply chain. Resour
Page 194 and 195:
which are necessary to make good mo
Page 196 and 197:
focusing on the computational compl
Page 198 and 199:
line of research that deals with ne
Page 200 and 201:
ealizable. But the inherent complex
Page 202 and 203:
Min, H. and Zhou, G., 2002, Supply
Page 204 and 205:
In the rest of this paper we report
Page 206:
shows relatively irregular behavior
show all

DARPA ULTRALOG Final Report - Industrial and Manufacturing ...

Create successful ePaper yourself

Delete template?

Save as template?