Skip to content

Pull requests: intel/xFasterTransformer

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Denpendency] Pin python requirements.txt version.
#458 opened Jun 25, 2024 by Duyi-Wang Loading…
Fixed punctuation error in README
#444 opened Jun 10, 2024 by denniszhen1 Loading…
[Layers] Increased the threshold for enabling flashAttn performance performance related.
#428 opened Jun 3, 2024 by abenmao Loading…
[Kernel] Add dynamic onednn matmul. performance performance related.
#425 opened May 28, 2024 by changqi1 Draft
[Model] Achieve whole pipeline parallel. enhancement New feature or request gpu Related to GPU
#355 opened Apr 28, 2024 by changqi1 Draft
[Eval] Add eval test with opencompass. benchmark performance or accuracy benchmark enhancement New feature or request
#325 opened Apr 17, 2024 by marvin-Yu Draft
Update AWQ GPTQ quantization guide documentation Improvements or additions to documentation
#306 opened Apr 10, 2024 by miaojinc Loading…
ProTip! Filter pull requests by the default branch with base:main.