Open in app

Sign in

Write

Sign in

Andrew Zhu
Andrew Zhu

802 Followers

Home

About

Pinned

Articles Catalog

A content list for easy navigation — Machine Learning & Data Analysis You ask the questions, GPT digs the insights Understand Transformer Attention In Plain Language Run AI Models in CUDA-Enabled WSL2 Install CUDA for Ubuntu in 5 mins Build A Capable Machine For LLM and AI Performance Testing Note of Diffusers With PyTorch 2.0 Improving Diffusers Package for High-Quality Image Generation …

Catalog

2 min read

Catalog

2 min read


Published in

Stackademic

·Nov 21

Load up and Run any 4-bit LLM models using Huggingface Transformers

Solve the 4-bit LLM setup problems all at one time — I was trying to run open-source LLM using the Huggingface Transformers model in 4-bit quantization mode. Why 4-bit quantization? because I can only run a 4-bit 30B LLM model in a GPU with 24G VRAM. Read several articles such as Making LLMs even more accessible with bitsandbytes, 4-bit quantization and…

Llm

4 min read

Llm

4 min read


Published in

Towards Data Science

·Jul 28

Hugging Face Diffusers can correctly load LoRA

Using the Latest Diffusers Monkey Patching function to load LoRA produces exactly the same result compare with A1111 — Pull the latest code from Hugging Face’s Diffusers code repository, and found that the newest code updated related to LoRA loading is updated and can do Monkey-Patching LoRA loading now. To install the latest Diffusers: pip install -U git+https://github.com/huggingface/diffusers.git@main

Stable Diffusion

5 min read

Hugging Face Diffusers can correctly load LoRA
Hugging Face Diffusers can correctly load LoRA
Stable Diffusion

5 min read


Published in

Data Science at Microsoft

·Jul 18

You ask the questions, GPT digs the insights

A Gen AI approach to business data analysis — By Andrew Zhu and Jacob H. Marquez The “Chat” in ChatGPT conveys that ChatGPT can provide information as a sort of dialog — albeit with an information cut date of September 2021 that reflects the temporal limit of ChatGPT’s awareness. But ChatGPT goes beyond being merely a powerful tool for…

AI

17 min read

You ask the questions, GPT digs the insights
You ask the questions, GPT digs the insights
AI

17 min read


Published in

CodeX

·Jul 15

Understand Transformer Attention In Plain Language

Explain the Attention mechanism with a concrete sample — The Transformer machine learning model brought by the paper: Attention Is All You Need [1], completely transformed the machine learning area. And the Attention mechanism function as the core of the Transform model. The original paper throws us a formula like this without too much explanation: This article is trying…

Transformers

4 min read

Understand Transformer Attention In Plain Language
Understand Transformer Attention In Plain Language
Transformers

4 min read


Published in

CodeX

·Jun 15

Run AI Models in CUDA-Enabled WSL2

Setup a CUDA-enabled WSL2 in Windows for LLM and Stable Diffusion models without performance sacrifice — There will be a time you need to run your models in a Linux environment. However, your most powerful GPU is running a Windows, and you don’t want to replace the whole system with Linux, and those oh, those games. WSL2 is a good choice that balances the gap between…

Llm

4 min read

Run AI Models in CUDA-Enabled WSL2
Run AI Models in CUDA-Enabled WSL2
Llm

4 min read


May 14

Install CUDA for Ubuntu in 5 mins

One tweet from https://twitter.com/finbarrtimbers says: I enjoyed the CUDA driver installation for 2 days plus another day to reinstall the whole system around 2 years ago. Now in 2023, with the correct steps, you can only enjoy CUDA drivers installation for just 5 mins in Ubuntu, here are the steps: …

Cuda

2 min read

Install CUDA for Ubuntu in 5 mins
Install CUDA for Ubuntu in 5 mins
Cuda

2 min read


Published in

CodeX

·May 12

Build A Capable Machine For LLM and AI

Build A Dual GPUs PC for Machine Learning and AI with Minimum cost — Background and Building Target Both Stable Diffusion and offline LLM models require a huge amount of RAM and VRAM. To run and learn those models, I bought an RTX 3090 for its 24G VRAM. Actually, my aging Intel i7–6700k can still work well with a single RTX 3090, but when I throw another GPU…

Machine Learning

10 min read

Build A Capable Machine For LLM and AI
Build A Capable Machine For LLM and AI
Machine Learning

10 min read


Published in

Better Programming

·Apr 16

Performance Testing Note of Diffusers With PyTorch 2.0

Test various methods to boost Stable Diffusion package Diffusers' performance and lower VRAM usage — This is a testing note of running Diffusers with the latest PyTorch 2.0⁴. The purpose is to test out if all those optimization methods work or not if they work, how much performance improved. This article is going to cover the following performance & optimization methods: What is the baseline…

Stable Diffusion

7 min read

Performance Testing Note of Diffusers With PyTorch 2.0
Performance Testing Note of Diffusers With PyTorch 2.0
Stable Diffusion

7 min read


Published in

Towards Data Science

·Apr 5

Improving Diffusers Package for High-Quality Image Generation

Overcoming token size limitations, custom model loading, LoRa support, textual inversion support, and more — Stable Diffusion WebUI from AUTOMATIC1111 has proven to be a powerful tool for generating high-quality images using the Diffusion model. However, while the WebUI is easy to use, data scientists, machine learning engineers, and researchers often require more control over the image generation process. …

Stable Diffusion

14 min read

Improving Diffusers Package for High-Quality Image Generation
Improving Diffusers Package for High-Quality Image Generation
Stable Diffusion

14 min read

Andrew Zhu

Andrew Zhu

802 Followers

Data Scientist@MS | Automate everything | https://xhinker.medium.com/membership | https://www.linkedin.com/in/andrew-zhu-23407223/ | https://twitter.com/xhinker

Following
  • Leon Eversberg, M. Sc.

    Leon Eversberg, M. Sc.

  • Netflix Technology Blog

    Netflix Technology Blog

  • Deepsha Menghani

    Deepsha Menghani

  • Tobias van Schneider

    Tobias van Schneider

  • Saptarshi Chaudhuri

    Saptarshi Chaudhuri

See all (80)

Help

Status

About

Careers

Blog

Privacy

Terms

Text to speech

Teams