Skip to content

Commit 628c965

Browse files
committed
Update wayformer
1 parent 9b3789c commit 628c965

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

paper_notes/wayformer.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -26,6 +26,7 @@ There are many facets of motion forecasting (behavior prediction). This can be c
2626
- There are two distinctive concepts both denoted as multi-modality in motion forecasting. This also makes modeling for scene understanding challenging. --> Both can be translated as 多模态 in Chinese, which is also unfortunately inconvenient.
2727
- Input multimodality: the input to motion forecasting network is **heterogeneous**, such as road geometry, lane connectivity, time-varying traffic light state, and history of a dynamic set of agents and their interactions.
2828
- Output multimodality: the output of motion forecasting need to be **diverse** to reflect many possible underlying intents of the traffic agents, such as going straight, turning right, or making u-turns.
29+
- Output follows [multipath](multipath.md) and [multipath++](multipath++.md), regression based.
2930
- Overall structure
3031
- The scene encoder is one or more attention encoders that summarize the driving space.
3132
- The decoder is a stack of standard transformer cross attention blocks, learned initial queries are fed in, and then cross-attended with the scene encoding to produce trajectories.

0 commit comments

Comments
 (0)