Wenqi GlantzinTowards Data ScienceThe Journey of RAG Development: From Notebook to MicroservicesConverting a Colab notebook to two microservices with support for Milvus and NeMo Guardrails·11 min read·Feb 21, 2024--3--3
Wenqi GlantzinTowards Data ScienceNeMo Guardrails, the Ultimate Open-Source LLM Security ToolkitExploring NeMo Guardrails’ practical use cases·13 min read·Feb 9, 2024----
Wenqi GlantzinTowards Data Science12 RAG Pain Points and Proposed SolutionsSolving the core challenges of Retrieval-Augmented Generation·18 min read·Jan 30, 2024--10--10
Wenqi GlantzinTowards Data ScienceJump-start Your RAG Pipelines with Advanced Retrieval LlamaPacks and Benchmark with Lighthouz AIExploring robust RAG development with LlamaPacks, Lighthouz AI, and Llama Guard·12 min read·Jan 29, 2024--2--2
Wenqi GlantzinTowards Data ScienceExploring mergekit for Model Merge, AutoEval for Model Evaluation, and DPO for Model Fine-tuningMy observations from experimenting with model merge, evaluation, and two model fine-tuning techniques·14 min read·Jan 19, 2024--3--3
Wenqi GlantzinTowards Data ScienceDemocratizing LLMs: 4-bit Quantization for Optimal LLM InferenceA deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex·15 min read·Jan 15, 2024--2--2
Wenqi GlantzinTowards Data ScienceDeploying LLM Apps to AWS, the Open-Source Self-Service WayA step-by-step guide on deploying LlamaIndex RAGs to AWS ECS fargate·12 min read·Jan 8, 2024--3--3
Wenqi GlantzinTowards Data ScienceSafeguarding Your RAG Pipelines: A Step-by-Step Guide to Implementing Llama Guard with LlamaIndexHow to add Llama Guard to your RAG pipelines to moderate LLM inputs and outputs and combat prompt injection·15 min read·Dec 27, 2023--2--2
Wenqi GlantzinLevel Up Coding10+ Ways to Run Open-Source Models with LlamaIndexLlamaIndex’s open-source model integration with Hugging Face, vLLM, Ollama, Llama.cpp, liteLLM, Replicate, Gradient, and more·18 min read·Dec 19, 2023--3--3