Compact guide to run Huggingface Transformers with local LLaMA style models using Python with practical installation and inference tips.