Allow users to retrieve attention scores when using vLLM.
Thanks for the excellent work! I wonder can we get the attention scores when using vLLM?