Top-K#

Optimal Cutoff Timestep (OCT)#

g(X)=argmint^{t^1>t^:1(f(X[t^1])=y)}

Top-K Gap for Cutoff Approximation#

The defination of Topk(Y(t)) as the top-k output occurring in one neuron of the output layer,

Ygap=Top1(Y(t))Top2(Y(t)),

which denotes the gap of top-1 and top-2 values of output Y(t). Then, we let D{} denote the inputs in subset of D that satisfy a certain condition. Now, we can define the confidence rate as follows:

Confidence rate: C(t^,D{Ygap>β})=1|D{Ygap>β}|XD{Ygap>β}(g(X)t^),

The algorithm searches for a minimum βR+ at a specific t^, as expressed in the following optimization objective:

argminβC(t^,D{Ygap>β})1ϵ,

where ϵ is a pre-specified constant such that 1ϵ represents an acceptable level of confidence for activating cutoff, and a set of β is extracted under different t^ using training samples.