Revision as of 20:33, 19 August 2008

When you want to evaluate your model over many input scenarios, Analytica's array abstraction mechanism is very convenient. You simple add indexes to your inputs and array abstraction does the rest automatically, propagating those indexes to all downstream results. However, at some point you may find yourself limited by the amount of usable memory, since all those intermediate results, for all scenarios, are cached and consuming memory.

To get around this, you may consider looping over your model (or a subset of your model), and collecting the results for all scenarios only for a selected output variable. By doing so, the intermediate steps in your model consume only the memory required for a single scenario. This technique can sometimes enable a large scale analysis that would otherwise not be possible.

This technique of looping over your model is only possible when your model does not operate over the index you are looping over. If any variable in your model operates over the index, such as the way Sum(...,I) operates over index I, or SDeviation(X)' operates over the Run index, then looping is not a viable option.

The basic technique is demonstrated by the following example. Suppose your model has an input X, indexed only by I, and an output Y, and that the intermediate steps to no operate over the index I. Then we can compute Y by looping as follows:

Variable Y_by_looping := 
   For xi[ ] := X do WhatIf(Y,X,xi)

The same technique can be used for large-scale Monte Carlo sampling. Suppose we have a single uncertain scalar input, U, and a result Z. We can compute the result for Z one-sample at a time using:

Variable Z_by_looping :=
   For ui[ ] := U do WhatIf(Z,U,ui)

With this technique you need to be careful to ensure that your model does not use any statistical functions, since these operate over the Run index. You must also be careful not to switch evaluation mode. In other words, Sample(Z) must not depend on Mid(U), and Mid(Z) must not depend on Sample(Z).

Extensions

Numerous extensions to the above simple technique are often required. These are described here.

Other input dimensions

Your inputs may have indexes that your model does operate over, so you cannot loop over those indexes. For example, an input X may have four dimensions: Time, District, AssumptionSet and Scenario. Your model operates over the Time and District indexes, so you can't loop over these, but you do wish to loop over the AssumptionSet and Scenario indexes. The above approach is then altered by specifying the indexes that are to be left in-tact (i.e., Time) in the inputs as follows:

Variable Y_by_looping := 
   For xi[Time,District] := X do WhatIf(Y,X,xi)

Alternatively, instead of being explicit about which indexes are not looped over, you can be explicit about which indexes are looped over:

Variable Y_by_looping := 
   For s := Scenario do (
      [[For a := AssumptionSet do (
         WhatIf( Y, X, X[Scenario=s,AssumptionSet=a] )
      )
   )

The first technique of listing variables that are not looped over is in many ways the most flexible, since the technique continue to keep memory usage low of other parametric dimensions are introduced by the user.

Multiple inputs

When you have more than one input that uses the index being looped over, the technique is extended by using nested WhatIf calls. To coordinate the "slicing" among the inputs, the easiest and most flexible technique is to introduce a User-Defined Function, and to declare the dimensions of each input that are NOT looped over in the parameter list:

 Function ComputeY( v1,v2 : Array[Time,AssumptionSet] ) :=  WhatIf( WhatIf( Y, X1, v1 ), X2, v2 )
  
 Variable Y_by_looping := ComputeY(X1,X2)

Multiple Outputs

The key to keeping memory consumption low is to only collect the full results for a handful of output variables. If you have more than one output of interest, you want to coordinate the computation so that all are computed simultaneously with each pass. If you have a probability distribution in your model that depend on the selected inputs, and more than one output depends on that probability distribution, then you need to compute both outputs simultaneously so that they are based on the same sample. Also, by computing all outputs simultaneously, you avoid having to re-compute intermediate results each time.

To compute multiple results, you should bundle these into a record structure using References.

Index LoopOutputs := ['Y1','Y2','Y3']
 
Function ComputeOutputs( v1,v2 : Array[Time,AssumptionSet] ) :=
   WhatIf(WhatIf( Array(LoopOutputs,[\Y1,\Y2,\Y3]), X1,v1), X2,v2)

Variable Y2_by_looping := ComputedBy(Y1_by_looping)
 
Variable Y3_by_looping := ComputedBy(Y1_by_looping)
 
Variable Y1_by_looping := 
   Var y := ComputeOutputs(x1,x2);
   Y2_by_looping := #y[LoopOutputs='Y2'];
   Y3_by_looping := #y[LoopOutputs='Y3'];
   #y[LoopOutputs='Y1']

@@ Line 43: / Line 43: @@
 When you have more than one input that uses the index being looped over, the technique is extended by using nested [[WhatIf]] calls.  To coordinate the "slicing" among the inputs, the easiest and most flexible technique is to introduce a [[User-Defined Function]], and to declare the dimensions of each input that are NOT looped over in the parameter list:
-   Function ComputeY( v1,v2 : [[Function Parameter Qualifiers#Array|Array]][Time,AssumptionSet] ) :=  [[WhatIf]]( WhatIf( Y, X1, v1 ), X2, v2 )
+   Function ComputeY( v1,v2 : [[Function Parameter Qualifiers#Array|Array]][Time,AssumptionSet] ) :=  [[WhatIf]]( [[WhatIf]]( Y, X1, v1 ), X2, v2 )
+  &nbsp;
+  Variable Y_by_looping := ComputeY(X1,X2)
+== Multiple Outputs ==
+The key to keeping memory consumption low is to only collect the full results for a handful of output variables.  If you have more than one output of interest, you want to coordinate the computation so that all are computed simultaneously with each pass.  If you have a probability distribution in your model that depend on the selected inputs, and more than one output depends on that probability distribution, then you need to compute both outputs simultaneously so that they are based on the same sample.  Also, by computing all outputs simultaneously, you avoid having to re-compute intermediate results each time.
+To compute multiple results, you should bundle these into a record structure using [[References]].
+ Index LoopOutputs := ['Y1','Y2','Y3']
+ &nbsp;
+ Function ComputeOutputs( v1,v2 : Array[Time,AssumptionSet] ) :=
+    [[WhatIf]]([[WhatIf]]( Array(LoopOutputs,[\Y1,\Y2,\Y3]), X1,v1), X2,v2)
+ Variable Y2_by_looping := [[ComputedBy]](Y1_by_looping)
+ &nbsp;
+ Variable Y3_by_looping := [[ComputedBy]](Y1_by_looping)
+ &nbsp;
+ Variable Y1_by_looping :=
+    [[Var]] y := ComputeOutputs(x1,x2);
+    Y2_by_looping := #y[LoopOutputs='Y2'];
+    Y3_by_looping := #y[LoopOutputs='Y3'];
+    #y[LoopOutputs='Y1']

Difference between revisions of "Looping over a model"

Revision as of 20:33, 19 August 2008

Contents

Extensions

Other input dimensions

Multiple inputs

Multiple Outputs