11.08.2013 Views

Information Sharing in a Multi-Echelon Inventory System

Information Sharing in a Multi-Echelon Inventory System

Information Sharing in a Multi-Echelon Inventory System

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

ZHAO Xiaobo (赵晓波) et al:<strong>Information</strong> <strong>Shar<strong>in</strong>g</strong> <strong>in</strong> a <strong>Multi</strong>-<strong>Echelon</strong>… 469<br />

{ ( ′ ′ ) ( ) ( ) }<br />

P x0, x1 | x0, x1 , a1 x0, x1<br />

=<br />

ϕ0<br />

+<br />

⎧ 1 { x′ 0 = ( x0 + a0 ( x0) − a1( x0, x1)) } ⋅ P{ D = x1 + m1 − x′ 1} ,<br />

⎪<br />

ϕ0<br />

+<br />

⎨1<br />

{ x′ 0 = ( x0 + a0 ( x0) − a1( x0, x1)) } ⋅ P{ D≥x1 + m1} ,<br />

⎪<br />

⎪⎩<br />

0, if 0 < x′ 1≤x1 + m1;<br />

if x′<br />

1 = 0;<br />

if x′ 1 > x1 + m1<br />

(14)<br />

m = m<strong>in</strong> a ( x , x ) , x<br />

ϕ0<br />

+ a ( x ) . The transi- retailer’s policy ϕ 1 = { a1( x0, x1) ∈ A1( x0, x1) , ( x0, x1)<br />

∈<br />

where 1 { 1 0 1 0 0 0 }<br />

tion probability of the retailer only depends on the current<br />

state and action.<br />

From the def<strong>in</strong>ition of the DTMDP, with a given policy<br />

ϕ 0 of the supplier, { ( x0, x1) , a1( x0, x1)<br />

} is a DTMDP.<br />

The one-step transition cost can be calculated as discussed<br />

<strong>in</strong> Section 2. The DTMDP solution gives the<br />

P ( x′ , x′ ) | ( x , x ) , a ( x , x ) =<br />

{ }<br />

0 1 0 1 0 0 1<br />

{ ϕ1<br />

+ } { }<br />

{ ϕ1<br />

+ } { ≥ }<br />

1 } S . S<strong>in</strong>ce full <strong>in</strong>formation shar<strong>in</strong>g is available, this<br />

policy ϕ 1 is also known to the supplier. The one-step<br />

transition probability of the supplier from state<br />

( x0, x1)<br />

to state ( x′ 0, x′<br />

1)<br />

with action a0( x0, x1)<br />

is<br />

given by<br />

⎧ 1 x′ 0 = ( x0 + a0( x0) − a1 ( x0, x1)) ⋅ P D = x1 + m0 − x′ 1 , if 0 < x′ 1≤x1 + m0;<br />

⎪<br />

⎨1<br />

x′ 0 = ( x0 + a0( x0) − a1 ( x0, x1)) ⋅ P D x1 + m0 , if x′<br />

1 = 0;<br />

⎪<br />

⎪⎩<br />

0, if x′ 1 > x1 + m0<br />

With a given policy ϕ 1 of the retailer,<br />

{ ( x x ) a ( x x ) }<br />

0, 1 , 0 0, 1 is also a DTMDP, with the solution<br />

giv<strong>in</strong>g a supplier’s policy ϕ′ 0 . If { ϕ0, ϕ1}<br />

is the<br />

equilibrium system policy, then<br />

ϕ ϕ′ = (16)<br />

0<br />

An iterative algorithm can be developed to search<br />

for the equilibrium policy { ϕ0, ϕ1}<br />

. The algorithm is<br />

as follows.<br />

Algorithm 1<br />

Step 1: Set an <strong>in</strong>itial supplier’s policy ϕ 0 .<br />

Step 2: Solve the DTMDP for the retailer given ϕ 0<br />

to get the retailer’s policy ϕ 1 .<br />

Step 3: Solve the DTMDP for the supplier given ϕ 1<br />

to get the supplier’s policy ϕ′ 0 .<br />

Step 4: If ϕ0 = ϕ′ 0 , stop; otherwise let ϕ′ 0 be the<br />

new supplier’s policy and go to Step 2.<br />

3.2 Non-<strong>in</strong>formation shar<strong>in</strong>g<br />

This model has no <strong>in</strong>formation shar<strong>in</strong>g <strong>in</strong> the system<br />

due to technical reasons or bus<strong>in</strong>ess confidentiality.<br />

The supplier cannot know the states and demands of<br />

the retailer, and the retailer cannot know the states and<br />

orders of the supplier. Therefore, both the supplier and<br />

0<br />

(15)<br />

the retailer can only observe their own <strong>in</strong>ventory states.<br />

The state spaces and the action spaces are<br />

S = 012 , , , , U<br />

(17)<br />

{ }<br />

0 0<br />

{ }<br />

A ( x ) = 01 , , 2,<br />

, U −x<br />

(18)<br />

0 0 0 0<br />

{ 012 }<br />

S = ,, , , U<br />

(19)<br />

1 1<br />

{ }<br />

A1( x1) = 0,1,2, , U1 −x1<br />

(20)<br />

If the retailer orders a1( x 1)<br />

from the supplier, he<br />

may receive less than a1( x 1)<br />

from the supplier. An<br />

order-delivery rate matrix, M, is used to describe the<br />

relationship between the order and the delivery to the<br />

retailer,<br />

⎡ m(0,<br />

0)<br />

⎤<br />

⎢<br />

m(1,0) m(1,1)<br />

⎥<br />

M = ⎢ ⎥<br />

⎢ ⎥<br />

⎢ ⎥<br />

⎣mU ( 1, 0) mU ( 1,1) mU ( 1, U1)<br />

⎦ (21)<br />

where 0 ≤mab ( , ) ≤ 1 is the probability that the delivery<br />

to the retailer is b if he orders a , and<br />

a<br />

( , ) 1<br />

b 0<br />

mab ∑ = , i.e., the summation of any row <strong>in</strong> the<br />

=<br />

matrix is equal to 1.<br />

With matrix M, the one-step transition probability<br />

for the retailer from state x 1 to state x′ 1 with action<br />

a ( x ) ∈ A( x ) is given by<br />

1 1 1 1

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!