Effective planning horizon narrow class

Theorem of effective horizon equivalence

Let $J_{R} : Π \to R$ be a return function, $π^{*} = \arg max_{π} J_{R} (π)$ , the sequence ${π^{k}}_{k \in N}$ is fixed, and $J_{R} (π^{*}) \neq J_{R} (π^{0})$ . We define:

H_{eff} (R, ε) = min {k : \frac{J_{R} (π^{*}) - J_{R} (π^{k})}{J_{R} (π^{*}) - J_{R} (π^{0})} \leq ε}

Then we prove that:

(\forall ε > 0 : H_{eff} (R^{'}, ε) = H_{eff} (R, ε)) ⟺ \exists a \neq 0, b \in R : J_{R^{'}} (π) = a J_{R} (π) + b \forall π \in {π^{*}, π^{0}} \cup {π^{k}}_{k \in N}

Lemma

Let $J_{R} (π^{*}) - J_{R} (π^{0}) \neq 0$ . Then:

(\forall ε > 0 : H_{eff} (R^{'}, ε) = H_{eff} (R, ε)) ⟺ \forall k \in N : f (k, R^{'}) = f (k, R)

where $f (k, R) := \frac{J_{R} (π^{*}) - J_{R} (π^{k})}{J_{R} (π^{*}) - J_{R} (π^{0})}$ .

Proof

$(⟸)$ : Suppose $\forall k : f (k, R^{'}) = f (k, R)$ . Then for every $ε$ , ${k : f (k, R^{'}) \leq ε} = {k : f (k, R) \leq ε}$ , hence $\forall ε > 0 : H_{eff} (R^{'}, ε) = H_{eff} (R, ε)$ .

$(⟹)$ : Assume there exists the smallest $k_{0}$ such that $f (k_{0}, R^{'}) \neq f (k_{0}, R)$ . Without loss of generality, let $f (k_{0}, R^{'}) < f (k_{0}, R)$ (the case $>$ is symmetric). Choose:

ε_{0} \in (f (k_{0}, R^{'}), f (k_{0}, R))

Then $k_{0}$ is included in the set ${k : f (k, R^{'}) \leq ε_{0}}$ , but not in ${k : f (k, R) \leq ε_{0}}$ . Consequently:

H_{eff} (R^{'}, ε_{0}) \leq k_{0} < H_{eff} (R, ε_{0}) ⟹ at ε = ε_{0} : H_{eff} (R^{'}, ε_{0}) \neq H_{eff} (R, ε_{0})

From necessity and sufficiency, the lemma is proven.

Now, based on the lemma, it suffices to prove:

(\forall k : f (k, R^{'}) = f (k, R)) ⟺ \exists a \neq 0, b : J_{R^{'}} (π) = a J_{R} (π) + b \forall π \in {π^{*}, π^{0}} \cup {π^{k}}_{k \in N}

Let's introduce the notation: $J := J_{R}$ , $J^{'} := J_{R^{'}}$ , $Δ_{k} := J (π^{*}) - J (π^{k})$ , $Δ_{0} := J (π^{*}) - J (π^{0}) \neq 0$ .

$(⟸)$ : Suppose $J^{'} (π) = a J (π) + b$ for all $π \in {π^{*}, π^{0}} \cup {π^{k}}$ , with $a \neq 0$ . Then for any $k$ :

J^{'} (π^{*}) - J^{'} (π^{k}) = (a J (π^{*}) + b) - (a J (π^{k}) + b) = a (J (π^{*}) - J (π^{k})) = a Δ_{k}

J^{'} (π^{*}) - J^{'} (π^{0}) = a (J (π^{*}) - J (π^{0})) = a Δ_{0}

Since $a \neq 0$ , the denominator $a Δ_{0} \neq 0$ , and:

f (k, R^{'}) = \frac{a Δ_{k}}{a Δ_{0}} = \frac{Δ_{k}}{Δ_{0}} = f (k, R)

$(⟹)$ : Suppose $\forall k \in N : f (k, R^{'}) = f (k, R)$ . This means:

\begin{matrix} (1) & \frac{J^{'} (π^{*}) - J^{'} (π^{k})}{J^{'} (π^{*}) - J^{'} (π^{0})} = \frac{J (π^{*}) - J (π^{k})}{J (π^{*}) - J (π^{0})} \end{matrix}

Where $J^{'} (π^{*}) \neq J^{'} (π^{0})$ , since otherwise $f (k, R^{'})$ would be undefined for all $k$ , contradicting the assumption $\forall k : f (k, R^{'}) = f (k, R)$ .

Let's define:

a := \frac{J^{'} (π^{*}) - J^{'} (π^{0})}{J (π^{*}) - J (π^{0})}

b := J^{'} (π^{*}) - a J (π^{*})

From $(1)$ , by multiplying both sides by $a Δ_{0}$ :

J^{'} (π^{*}) - J^{'} (π^{k}) = a (J (π^{*}) - J (π^{k})) ⟹ J^{'} (π^{k}) = J^{'} (π^{*}) - a J (π^{*}) + a J (π^{k}) ⟹ J^{'} (π^{k}) = a J (π^{k}) + b

We also separately check the cases with $π^{*}$ and $π^{0}$ :

$a J (π^{*}) + b = a J (π^{*}) + J^{'} (π^{*}) - a J (π^{*}) = J^{'} (π^{*})$

From the definition of $a$ , we have $J^{'} (π^{0}) = J^{'} (π^{*}) - a Δ_{0} = J^{'} (π^{*}) - a (J (π^{*}) - J (π^{0})) = a J (π^{0}) + b$ .

Thus, from necessity and sufficiency, the relation $J^{'} (π) = a J (π) + b$ is proven for all $π \in {π^{*}, π^{0}} \cup {π^{k}}_{k \in N}$ , where $a \neq 0$ , meaning $J^{'}$ is an affine transformation of $J$ on the specified set.

Since the proof establishes that $\forall ε > 0 : H_{eff} (R^{'}) = H_{eff} (R)$ if and only if $J_{R^{'}}$ is an affine transformation of $J_{R}$ ( $a \neq 0$ ) on $π^{*}, π^{0} \cup {π^{k}}_{k \in N}$ , and STARC-equivalence requires $J_{R^{'}} = a J_{R} + b$ with $a > 0$ on the entirety of $Π$ , it follows that STARC-equivalence is a strictly stronger condition. In particular, $[R]_{STARC} \subseteq [R]_{H_{eff}}$ , because any $a > 0$ is a special case of $a \neq 0$ , and the condition on the entirety of $Π$ implies the condition on a subset.

Counterexample to the Pointwise Implication

We show that replacing $\forall ε > 0$ with $\exists ε > 0$ breaks the theorem: we construct explicit $J, J^{'}$ for which the values $H_{eff} (R, ε_{0})$ and $H_{eff} (R^{'}, ε_{0})$ coincide at one specific $ε_{0}$ , but $J^{'}$ is not an affine transformation of $J$ on ${π^{*}, π^{0}} \cup {π^{k}}_{k \in N}$ .

Let us define

J (π^{*}) = 1, J (π^{k}) = \frac{k}{k + 1} (k \geq 0) .

Then $J (π^{0}) = 0$ , $J (π^{k}) \in [0, 1)$ for all $k$ , and $J (π^{k}) \to 1$ , so $π^{*} = \arg max_{π} J (π)$ and the condition $J (π^{*}) - J (π^{0}) = 1 \neq 0$ holds. A direct computation gives

f (k, R) = \frac{J (π^{*}) - J (π^{k})}{J (π^{*}) - J (π^{0})} = \frac{1 - k / (k + 1)}{1} = \frac{1}{k + 1} .

Take $ε_{0} = 2 / 5$ . The condition $1 / (k + 1) \leq 2 / 5$ holds for $k \geq 3 / 2$ , that is, $H_{eff} (R, ε_{0}) = 2$ .

Now define $J^{'}$ on the same $Π$ :

J^{'} (π^{*}) = 1, J^{'} (π^{0}) = 0, J^{'} (π^{1}) = \frac{2}{5}, J^{'} (π^{2}) = \frac{7}{10}, J^{'} (π^{k}) = 1 - \frac{1}{10 (k - 1)} for k \geq 3.

All values lie in $[0, 1)$ , so $π^{*} = \arg max J^{'}$ and $J^{'} (π^{*}) - J^{'} (π^{0}) = 1 \neq 0$ . We compute the ratios:

f (0, R^{'}) = 1, f (1, R^{'}) = \frac{3}{5}, f (2, R^{'}) = \frac{3}{10}, f (k, R^{'}) = \frac{1}{10 (k - 1)} \leq \frac{1}{20} for k \geq 3.

For $ε = ε_{0} = 2 / 5$ we have $f (0, R^{'}) = 1 > 2 / 5$ , $f (1, R^{'}) = 3 / 5 > 2 / 5$ , $f (2, R^{'}) = 3 / 10 \leq 2 / 5$ . Hence

H_{eff} (R^{'}, ε_{0}) = 2 = H_{eff} (R, ε_{0}) .

We show that there is no affine relation. Suppose $J^{'} (π) = a J (π) + b$ on the entire set ${π^{*}, π^{0}} \cup {π^{k}}$ . From $J (π^{*}) = J^{'} (π^{*}) = 1$ and $J (π^{0}) = J^{'} (π^{0}) = 0$ we get $b = 0$ and $a = 1$ , that is, $J^{'} (π) = J (π)$ . But $J^{'} (π^{1}) = 2 / 5 \neq 1 / 2 = J (π^{1})$ , so we have a contradiction.

Theorem of extension to dense subsets of $ε$

Fix $R, R^{'}$ for which $f (k, R), f (k, R^{'})$ are well-defined. Introduce the notation $ϕ_{R} (ε) := H_{eff} (R, ε) \in N \cup {+ \infty}$ , setting $ϕ_{R} (ε) = + \infty$ if ${k : f (k, R) \leq ε} = \emptyset$ . The function $ϕ_{R}$ is non-increasing on $(0, \infty)$ .

Let $E \subseteq (0, \infty)$ be dense in $(0, \infty)$ . Then

(i) \forall ε \in E : ϕ_{R} (ε) = ϕ_{R^{'}} (ε) ⟺ (ii) \forall ε > 0 : ϕ_{R} (ε) = ϕ_{R^{'}} (ε)

Combined with the original lemma, this yields the equivalence of $(i)$ with the affinity $J_{R^{'}} = a J_{R} + b$ , $a \neq 0$ , on ${π^{*}, π^{0}} \cup {π^{k}}$ .

Proof

The implication $(ii) \Rightarrow (i)$ is trivial.

We show $(i) \Rightarrow (ii)$ . By contradiction suppose there exists $ε^{*} > 0$ with $ϕ_{R} (ε^{*}) \neq ϕ_{R^{'}} (ε^{*})$ . Without loss of generality $ϕ_{R} (ε^{*}) < ϕ_{R^{'}} (ε^{*})$ . Denote $n := ϕ_{R} (ε^{*}) \in N$ .

By the definition of $ϕ_{R} (ε^{*}) = n$ we have $f (n, R) \leq ε^{*}$ . From the inequality $ϕ_{R^{'}} (ε^{*}) > n$ , for all $k \in {0, 1, \dots, n}$ we have $f (k, R^{'}) > ε^{*}$ . Set

m := min_{0 \leq k \leq n} f (k, R^{'}) \in (ε^{*}, + \infty] .

The minimum is over a finite set, and each term is strictly greater than $ε^{*}$ , so the minimum is also strictly greater than $ε^{*}$ . Take any $ε \in (ε^{*}, m)$ . Since $f (k, R^{'}) \geq m > ε$ for all $k \leq n$ , we have $ϕ_{R^{'}} (ε) > n$ .
On the other hand, $f (n, R) \leq ε^{*} < ε$ , so $n \in {k : f (k, R) \leq ε}$ and $ϕ_{R} (ε) \leq n$ . In total:

\forall ε \in (ε^{*}, m) : ϕ_{R} (ε) \leq n < ϕ_{R^{'}} (ε) .

The interval $(ε^{*}, m)$ is non-empty and open in $(0, \infty)$ . By density of $E$ there exists $ε_{0} \in E \cap (ε^{*}, m)$ . Then $ϕ_{R} (ε_{0}) \neq ϕ_{R^{'}} (ε_{0})$ , which contradicts $(i)$ .

Sharpness of the Density Condition

We can also show that the density of $E$ in $(0, \infty)$ cannot be weakened without losing the theorem for all admissible $R, R^{'}$ .

Suppose $E$ is not dense: there exists an open interval $(α, β) \subset (0, \infty)$ with $E \cap (α, β) = \emptyset$ . Without loss of generality $β < 1$ (otherwise shift the interval inside $(0, 1)$ , which preserves the property of not intersecting $E$ for a suitable subinterval; the informative zone of $ϕ$ lies precisely in $(0, 1)$ , since $f = 1$ gives $ϕ \equiv 0$ for $ε \geq 1$ ).

Choose $c_{R} \neq c_{R^{'}}$ inside $(α, β)$ . Define the returns via the values of $f$ :

f (0, R) = f (0, R^{'}) = 1, f (1, R) = c_{R}, f (1, R^{'}) = c_{R^{'}},

f (k, R) = f (k, R^{'}) = α / 2 for k \geq 2.

The corresponding $J, J^{'}$ are obtained as $J (π^{*}) = J^{'} (π^{*}) = 1$ , $J (π^{0}) = J^{'} (π^{0}) = 0$ , $J (π^{k}) = 1 - f (k, R)$ , $J^{'} (π^{k}) = 1 - f (k, R^{'})$ . All values $J (π^{k}), J^{'} (π^{k})$ lie in $[0, 1)$ , so $π^{*}$ remains the argmax for both returns, and the conditions of the lemma are satisfied.

We compute $ϕ_{R}$ :

$ε \geq 1$ : $f (0) = 1 \leq ε$ , $ϕ_{R} (ε) = 0$ .
$ε \in [c_{R}, 1)$ : $f (1, R) = c_{R} \leq ε < 1 = f (0)$ , $ϕ_{R} (ε) = 1$ .
$ε \in [α / 2, c_{R})$ : $f (k, R) = α / 2 \leq ε$ for $k \geq 2$ , $f (1, R) > ε$ , $ϕ_{R} (ε) = 2$ .
$ε < α / 2$ : no $f$ qualifies, $ϕ_{R} (ε) = + \infty$ .

The same for $R^{'}$ with the substitution $c_{R} \to c_{R^{'}}$ . A discrepancy is possible only in the zone between $c_{R}$ and $c_{R^{'}}$ : for $ε \in [min (c_{R}, c_{R^{'}}), max (c_{R}, c_{R^{'}}))$ one of the functions equals $1$ , the other equals $2$ . But this zone lies entirely in $(α, β)$ and hence does not intersect $E$ . Outside $(α, β)$ both functions $ϕ_{R}, ϕ_{R^{'}}$ coincide by construction.

We obtained that $ϕ_{R} = ϕ_{R^{'}}$ on $E$ , but not on all of $(0, \infty)$ as $(i)$ holds while $(ii)$ fails. By the original lemma, the failure of $(ii)$ means that $J^{'}$ is not affinely related to $J$ on ${π^{*}, π^{0}} \cup {π^{k}}$ .

As a result, for any $E \subseteq (0, \infty)$ , the property " $(i)$ implies affinity for all admissible $R, R^{'}$ " is equivalent to the density of $E$ in $(0, \infty)$ .

Theorem of effective horizon equivalence

Lemma

Proof

Counterexample to the Pointwise Implication

Theorem of extension to dense subsets of ε

Proof

Sharpness of the Density Condition

Theorem of extension to dense subsets of $ε$