Evaluation Metrics

For each trajectory segment, participants are required to submit only the predicted longitudinal positions of the following automated vehicle (FAV) over the prediction horizon. The organizing team then derives all other evaluation variables, including speed, acceleration, jerk, spatial headway, spatial gap, and time-to-collision (TTC), from the submitted FAV trajectory and the provided lead-vehicle (LV) trajectory.

The evaluation is based on three aspects: Accuracy, Safety, and Comfort.

The final submission score is defined as the average per-segment total score across all evaluated trajectory segments.

Notation

VariableDescription
iIi \in \mathcal{I}Trajectory-segment index
tTit \in \mathcal{T}_iDiscrete time index for trajectory segment ii
Δt\Delta tSimulation time interval
xitlx_{it}^{l}Provided longitudinal position of the lead vehicle at time tt
x^itf\hat{x}_{it}^{f}Predicted longitudinal position of the FAV at time tt
xitfx_{it}^{f}Reference longitudinal position of the FAV at time tt
vitlv_{it}^{l}Speed of the LV derived from the provided trajectory
v^itf\hat{v}_{it}^{f}Predicted speed of the FAV
vitfv_{it}^{f}Reference speed of the FAV
a^itf\hat{a}_{it}^{f}Predicted acceleration of the FAV
aitfa_{it}^{f}Reference acceleration of the FAV
LlL^lLength of the lead vehicle (LV)
LfL^fLength of the following automated vehicle (FAV)
h^it\hat{h}_{it}Predicted spatial headway
hith_{it}Reference spatial headway
g^it\hat{g}_{it}Predicted spatial gap
gitg_{it}Reference spatial gap
j^itf\hat{j}_{it}^{f}Predicted longitudinal jerk of the FAV
TTCit\mathrm{TTC}_{it}Time-to-collision at time tt
TTCth\mathrm{TTC}_{\mathrm{th}}TTC safety threshold
Rv(i)R_v^{(i)}RMSE of FAV speed for trajectory segment ii
Rh(i)R_h^{(i)}RMSE of spatial headway for trajectory segment ii
Ra(i)R_a^{(i)}RMSE of FAV acceleration for trajectory segment ii
Rj(i)R_j^{(i)}RMS jerk of the predicted FAV trajectory
Rv,thR_{v,\mathrm{th}}Normalization threshold for speed RMSE
Rh,thR_{h,\mathrm{th}}Normalization threshold for headway RMSE
Ra,thR_{a,\mathrm{th}}Normalization threshold for acceleration RMSE
Rj,thR_{j,\mathrm{th}}Normalization threshold for jerk
Sspeed(i)S_{\mathrm{speed}}^{(i)}Normalized speed score
Sheadway(i)S_{\mathrm{headway}}^{(i)}Normalized headway score
Sacc(i)S_{\mathrm{acc}}^{(i)}Normalized acceleration score
Sa(i)S_a^{(i)}Accuracy score
Ss(i)S_s^{(i)}Safety score
Sc(i)S_c^{(i)}Comfort score
Stotal(i)S_{\mathrm{total}}^{(i)}Total score for trajectory segment ii
SfS_fFinal submission score

For evaluation, the following derived quantities are used:

v^itf=x^i,t+1fx^itfΔt,vitf=xi,t+1fxitfΔt,vitl=xi,t+1lxitlΔt \hat{v}_{it}^{f}=\frac{\hat{x}_{i,t+1}^{f}-\hat{x}_{it}^{f}}{\Delta t}, \qquad v_{it}^{f}=\frac{x_{i,t+1}^{f}-x_{it}^{f}}{\Delta t}, \qquad v_{it}^{l}=\frac{x_{i,t+1}^{l}-x_{it}^{l}}{\Delta t} a^itf=v^i,t+1fv^itfΔt,aitf=vi,t+1fvitfΔt \hat{a}_{it}^{f}=\frac{\hat{v}_{i,t+1}^{f}-\hat{v}_{it}^{f}}{\Delta t}, \qquad a_{it}^{f}=\frac{v_{i,t+1}^{f}-v_{it}^{f}}{\Delta t}

The spatial headway and spatial gap are defined as

h^it=xitlx^itf,hit=xitlxitf \hat{h}_{it}=x_{it}^{l}-\hat{x}_{it}^{f}, \qquad h_{it}=x_{it}^{l}-x_{it}^{f} g^it=xitlx^itfLl+Lf2,git=xitlxitfLl+Lf2 \hat{g}_{it}=x_{it}^{l}-\hat{x}_{it}^{f}-\frac{L^l+L^f}{2}, \qquad g_{it}=x_{it}^{l}-x_{it}^{f}-\frac{L^l+L^f}{2}

Accuracy

The accuracy score measures how closely the predicted FAV trajectory matches the reference trajectory in terms of speed, spatial headway, and acceleration.

Rv(i)=1TitTi(v^itfvitf)2 R_v^{(i)}= \sqrt{ \frac{1}{|\mathcal{T}_i|} \sum_{t\in\mathcal{T}_i} \left(\hat{v}_{it}^{f}-v_{it}^{f}\right)^2 } Rh(i)=1TitTi(h^ithit)2 R_h^{(i)}= \sqrt{ \frac{1}{|\mathcal{T}_i|} \sum_{t\in\mathcal{T}_i} \left(\hat{h}_{it}-h_{it}\right)^2 } Ra(i)=1TitTi(a^itfaitf)2 R_a^{(i)}= \sqrt{ \frac{1}{|\mathcal{T}_i|} \sum_{t\in\mathcal{T}_i} \left(\hat{a}_{it}^{f}-a_{it}^{f}\right)^2 }

These RMSE values are normalized into dimensionless scores in [0,1][0,1] using predefined thresholds:

Sspeed(i)=max(0,  1Rv(i)Rv,th) S_{\mathrm{speed}}^{(i)}= \max\left(0,\;1-\frac{R_v^{(i)}}{R_{v,\mathrm{th}}}\right) Sheadway(i)=max(0,  1Rh(i)Rh,th) S_{\mathrm{headway}}^{(i)}= \max\left(0,\;1-\frac{R_h^{(i)}}{R_{h,\mathrm{th}}}\right) Sacc(i)=max(0,  1Ra(i)Ra,th) S_{\mathrm{acc}}^{(i)}= \max\left(0,\;1-\frac{R_a^{(i)}}{R_{a,\mathrm{th}}}\right)

The accuracy score is then computed as

Sa(i)=0.4Sspeed(i)+0.4Sheadway(i)+0.2Sacc(i) S_a^{(i)}= 0.4\,S_{\mathrm{speed}}^{(i)} +0.4\,S_{\mathrm{headway}}^{(i)} +0.2\,S_{\mathrm{acc}}^{(i)}

Safety

The safety score evaluates collision risk at the trajectory-segment level. A collision is identified when the predicted spatial gap becomes negative at any time step and is treated as a hard failure. If no collision occurs, safety is evaluated based on the proportion of time steps whose TTC falls below a predefined safety threshold.

For each trajectory segment ii and time step tt, TTC is defined as

TTCit={g^itv^itfvitl,if v^itf>vitl,+,otherwise \mathrm{TTC}_{it}= \begin{cases} \dfrac{\hat{g}_{it}}{\hat{v}_{it}^{f}-v_{it}^{l}}, & \text{if } \hat{v}_{it}^{f}>v_{it}^{l},\\[8pt] +\infty, & \text{otherwise} \end{cases}

A TTC violation indicator is defined as

δit={1,if TTCit<TTCth0,otherwise \delta_{it}= \begin{cases} 1, & \text{if } \mathrm{TTC}_{it}<\mathrm{TTC}_{\mathrm{th}}\\ 0, & \text{otherwise} \end{cases}

The TTC violation ratio for trajectory segment ii is

Rviol(i)=1TitTiδit R_{\mathrm{viol}}^{(i)}= \frac{1}{|\mathcal{T}_i|} \sum_{t\in\mathcal{T}_i}\delta_{it}

The safety score is then defined as

Ss(i)={0,if g^it<0 for any tTi1Rviol(i),otherwise S_s^{(i)}= \begin{cases} 0, & \text{if } \hat{g}_{it}<0 \text{ for any } t\in\mathcal{T}_i\\[6pt] 1-R_{\mathrm{viol}}^{(i)}, & \text{otherwise} \end{cases}

Comfort

The comfort score evaluates the smoothness of the predicted FAV trajectory using longitudinal jerk.

j^itf=a^i,t+1fa^itfΔt \hat{j}_{it}^{f}= \frac{\hat{a}_{i,t+1}^{f}-\hat{a}_{it}^{f}}{\Delta t} Rj(i)=1Ti1t=1Ti1(j^itf)2 R_j^{(i)}= \sqrt{ \frac{1}{|\mathcal{T}_i|-1} \sum_{t=1}^{|\mathcal{T}_i|-1} \left(\hat{j}_{it}^{f}\right)^2 }

The comfort score is defined as

Sc(i)=max(0,  1Rj(i)Rj,th) S_c^{(i)}= \max\left(0,\;1-\frac{R_j^{(i)}}{R_{j,\mathrm{th}}}\right)

Final Score

For each trajectory segment, the total score is computed as

Stotal(i)=0.5Sa(i)+0.3Ss(i)+0.2Sc(i) S_{\mathrm{total}}^{(i)}= 0.5\,S_a^{(i)} +0.3\,S_s^{(i)} +0.2\,S_c^{(i)}

with

0Stotal(i)1 0 \le S_{\mathrm{total}}^{(i)} \le 1

The final submission score is the average total score across all evaluated trajectory segments:

Sf=1IiIStotal(i) S_f= \frac{1}{|\mathcal{I}|} \sum_{i\in\mathcal{I}} S_{\mathrm{total}}^{(i)}