InfiniTensor

History

Hardy 42032356fb Bang cncl (#163 ) * MLU CNCL base * add FindCNCL.cmake, not find -lcncl * bangPrintFloat not find * docker:make sucessful, test error * delete net file and onnxtest.py * init * fix cncl * format * fix * format * fix cncl * run dist gpt2 on mlu * format * fix import error on mlu docker * run llama single card * run distributed llama2 * add test for slice/reduce on mlu * fix cncl related test * fix format * format * delete comments * change GPU to MLU * MLU CNCL base * add FindCNCL.cmake, not find -lcncl * bangPrintFloat not find * docker:make sucessful, test error * delete net file and onnxtest.py * init * fix cncl * format * fix * format * fix cncl * run dist gpt2 on mlu * format * fix import error on mlu docker * run llama single card * run distributed llama2 * add test for slice/reduce on mlu * fix cncl related test * fix format * format * delete comments * change GPU to MLU * modify launch script * fix name * fix format * fix gather * format python script --------- Co-authored-by: xgqdut2016 <kenan_gewei@163.com> Co-authored-by: Bolun <chamberlain0w0@gmail.com> Co-authored-by: Bolun Zhang <48948016+Chamberlain0w0@users.noreply.github.com>		2024-01-03 13:28:03 +08:00
..
bang_launch.py	Bang cncl (#163 )	2024-01-03 13:28:03 +08:00
launch.py	fix tensor parallel for llama (#159 )	2023-10-30 15:04:16 +08:00
launch_kvcache.py	Support kvcache (#134 )	2023-09-18 14:17:02 +08:00
parallel.py	impl distributed launch with NCCL (#106 )	2023-09-05 09:47:35 +08:00
parallel_opt.py	Bang cncl (#163 )	2024-01-03 13:28:03 +08:00
placement.py	tensor parallel for transformer (#125 )	2023-09-14 14:19:45 +08:00