Skip to content

Latest commit

 

History

History
29 lines (22 loc) · 3.93 KB

File metadata and controls

29 lines (22 loc) · 3.93 KB

The readme contains description of experiment notebooks in this repository.

Resource utilization

  • gpu-footprint.ipynb computes gpu requirements of models with different number of model parameters.

Fine tuning

  • flan-t5-3B-general-tasks: finetunes the flan t5 model for sentiment analysis task and text summarization task.
  • flan-t5-3B-RosaQA: finetunes the flan t5 model for question answering with ROSA service documentation. It then shows before and after finetuning quality of model answers.

Evaluation

  • QA_evaluation_metrics_demo.ipynb explores evaluation metrics for LLMs in Question-Answering (QA) tasks. It covers metrics for QA as well as metrics related to model complexity and human evaluation.
  • langchain-evaluation.ipynb explores how we can use specific criteria to evaluate model outputs, focussing on detailed examination within the Langchain framework.
  • retriever-evaluation.ipynb explores evaluation metrics for document retrieval techniques.

Model Serving

Feedback