feat: flashv2 #94

koritsky · 2024-04-08T09:14:15Z

Add flash attention v2
Basically, just extracted flash specific changes from https://github.com/yaak-ai/carGPT/tree/xformers-flashv2

Tested and it demonstrated running speed improvements: mean duration of train/val step (they are pretty similar) dropped by ~15% (1.9 -> 1.6 s)

koritsky · 2024-04-08T09:16:41Z

facebookresearch/xformers#700

Probably small performance increase is because we use custom attention masks

egorchakov

we'd also probably wanna add a corresponding branch to https://github.com/yaak-ai/carGPT/blob/main/cargpt/components/llm.py#L229 (for attention viz at inference time)

egorchakov · 2024-04-09T09:11:45Z

cargpt/components/__init__.py

@@ -3,6 +3,8 @@
 from tensordict.tensorclass import _eq, _getitem  # noqa: PLC2701
 from tensordict.utils import IndexType

+from cargpt.utils.attention import MemoryEfficientScaledDotProduct  # noqa #type: ignore


do we actually need this exported here?

We need to import it somewhere, so I find __init__.py the best choice

cargpt/utils/attention.py

koritsky · 2024-04-09T15:36:20Z

So let's just merge it without viz feature. Can still be used at full grown training after experimenting stage is completed (never)

koritsky marked this pull request as draft April 8, 2024 09:30

koritsky changed the title ~~add flashv2~~ feat: flashv2 Apr 8, 2024

koritsky marked this pull request as ready for review April 8, 2024 11:24

sandhawalia requested a review from egorchakov April 9, 2024 08:14

egorchakov requested changes Apr 9, 2024

View reviewed changes

koritsky added 4 commits April 9, 2024 16:41

add flashv2

185e01f

formatting for passing pre-commit

44c3ebd

minor attention.py changes

9b8b55c

return original attention at smart.yaml

9c8029c

koritsky force-pushed the feat/flashv2 branch from 110dfcb to 9c8029c Compare April 9, 2024 14:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: flashv2 #94

feat: flashv2 #94

koritsky commented Apr 8, 2024 •

edited

Loading

koritsky commented Apr 8, 2024 •

edited

Loading

egorchakov left a comment

egorchakov Apr 9, 2024

koritsky Apr 9, 2024

koritsky commented Apr 9, 2024

feat: flashv2 #94

Are you sure you want to change the base?

feat: flashv2 #94

Conversation

koritsky commented Apr 8, 2024 • edited Loading

koritsky commented Apr 8, 2024 • edited Loading

egorchakov left a comment

Choose a reason for hiding this comment

egorchakov Apr 9, 2024

Choose a reason for hiding this comment

koritsky Apr 9, 2024

Choose a reason for hiding this comment

koritsky commented Apr 9, 2024

koritsky commented Apr 8, 2024 •

edited

Loading

koritsky commented Apr 8, 2024 •

edited

Loading