Machine Learning - Kelcode Development

Kelcode Development

Sign in Subscribe

Machine Learning

A collection of 4 posts

Debugging and Repairing TensorRT Inference

Debugging and Repairing TensorRT Inference

Tracing and fixing a TensorRT FP16 inference bug after fine-tuning a classification model. Full walkthrough: diagnosing ONNX input issues, rebuilding pipelines, validating logits and softmax, and benchmarking model size and speed.

Quantisation in Deep Learning: A Practical Lab Guide

Quantisation in Deep Learning: A Practical Lab Guide

Explore INT8 and FP16 quantisation of a LoRA-fine-tuned BERT across PyTorch, ONNX Runtime and TensorRT. Compare model size, inference latency and nine-way F1 to discover which workflow best balances accuracy and performance on CPU and GPU deployments.

Fine-Tuning a Model with LoRA

Fine-Tuning a Model with LoRA

Fine-tune BERT with LoRA adapters for custom category & headline classification. Walk through adapter integration, training loop, and evaluation... preparing your model for efficient quantisation.

AI Journey Data Science Track Lab 02 - Classical Machine Learning

Machine Learning: Building Our First Classifier with Scikit-Learn

Train your first ML model using Scikit-Learn and the classic Iris dataset. Learn how KNN works, evaluate model accuracy with a confusion matrix and classification report, and understand what features like petal length reveal about species prediction.