Script request

#1
by singhsidhukuldeep - opened

Hello @TechxGenus
Thanks for the quantisation πŸ™‚
Can you please share the script that you used to do it?
And if possible the hardware configuration?!

Also have you calculated the perplexity?

I use the default script of AutoAWQ, with 2 RTX 4090.
PPL: 2.892

Hey @TechxGenus
Thanks for the info.

Is this your local setup OR are you using any cloud provider?

Is my local setup.

Sign up or log in to comment