2.8K views
MIT HAN Lab
TinyChat Computer running Llama2-7B Jetson Orin Nano. Key technique: AWQ 4bit quantization.
Login with Google Login with Discord