Cut off the text as soon as any stop words occur.
Weight only quantized model.
To use, you should have the intel-extension-for-transformers packabge and
transformers package installed.
intel-extension-for-transformers:
https://github.com/intel/intel-extension-for-transformers