Wenqi Glantz – Medium

Pinned

Wenqi Glantz

An Overview of My Blog Posts

Learning by Blogging

4 min readJul 26, 2023

--

An Overview of My Blog Posts

--

Wenqi Glantz
in
Towards Data Science

The Journey of RAG Development: From Notebook to Microservices

Converting a Colab notebook to two microservices with support for Milvus and NeMo Guardrails

11 min readFeb 21, 2024

--

3

The Journey of RAG Development: From Notebook to Microservices

--

3

Wenqi Glantz
in
Towards Data Science

NeMo Guardrails, the Ultimate Open-Source LLM Security Toolkit

Exploring NeMo Guardrails’ practical use cases

13 min readFeb 9, 2024

--

NeMo Guardrails, the Ultimate Open-Source LLM Security Toolkit

--

Wenqi Glantz
in
Towards Data Science

12 RAG Pain Points and Proposed Solutions

Solving the core challenges of Retrieval-Augmented Generation

18 min readJan 30, 2024

--

10

12 RAG Pain Points and Proposed Solutions

--

10

Wenqi Glantz
in
Towards Data Science

Jump-start Your RAG Pipelines with Advanced Retrieval LlamaPacks and Benchmark with Lighthouz AI

Exploring robust RAG development with LlamaPacks, Lighthouz AI, and Llama Guard

12 min readJan 29, 2024

--

2

Jump-start Your RAG Pipelines with Advanced Retrieval LlamaPacks and Benchmark with Lighthouz AI

--

2

Wenqi Glantz
in
Towards Data Science

Exploring mergekit for Model Merge, AutoEval for Model Evaluation, and DPO for Model Fine-tuning

My observations from experimenting with model merge, evaluation, and two model fine-tuning techniques

14 min readJan 19, 2024

--

3

Exploring mergekit for Model Merge, AutoEval for Model Evaluation, and DPO for Model Fine-tuning

--

3

Wenqi Glantz
in
Towards Data Science

Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference

A deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex

15 min readJan 15, 2024

--

2

Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference

--

2

Wenqi Glantz
in
Towards Data Science

Deploying LLM Apps to AWS, the Open-Source Self-Service Way

A step-by-step guide on deploying LlamaIndex RAGs to AWS ECS fargate

12 min readJan 8, 2024

--

3

Deploying LLM Apps to AWS, the Open-Source Self-Service Way

--

3

Wenqi Glantz
in
Towards Data Science

Safeguarding Your RAG Pipelines: A Step-by-Step Guide to Implementing Llama Guard with LlamaIndex

How to add Llama Guard to your RAG pipelines to moderate LLM inputs and outputs and combat prompt injection

15 min readDec 27, 2023

--

2

Safeguarding Your RAG Pipelines: A Step-by-Step Guide to Implementing Llama Guard with LlamaIndex

--

2

Wenqi Glantz
in
Level Up Coding

10+ Ways to Run Open-Source Models with LlamaIndex

LlamaIndex’s open-source model integration with Hugging Face, vLLM, Ollama, Llama.cpp, liteLLM, Replicate, Gradient, and more

18 min readDec 19, 2023

--

3

10+ Ways to Run Open-Source Models with LlamaIndex

--

3

Wenqi Glantz

Wenqi Glantz

Mom, wife, architect with a passion for technology and crafting quality products linkedin.com/in/wenqi-glantz-b5448a5a/ twitter.com/wenqi_glantz

Following

Help
Status
About
Careers
Blog
Privacy
Terms
Text to speech
Teams