Content-Length: 53979 | pFad | https://dblp.org/rec/conf/nips/ZhangWB24

dblp: In-Context Learning of a Linear Transformer Block: Benefits of the MLP Component and One-Step GD Initialization.

"In-Context Learning of a Linear Transformer Block: Benefits of the MLP ..."

Ruiqi Zhang, Jingfeng Wu, Peter L. Bartlett (2024)

Details and statistics

DOI:

access: open

type: Conference or Workshop Paper

metadata version: 2025-02-13









ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: https://dblp.org/rec/conf/nips/ZhangWB24

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy