Large Language Model Routing with Benchmark Datasets
There is a rapidly growing number of open-source Large Language Models (LLMs) and benchmark datasets to compare them. While some models dominate these benchmarks, no single model typically achieves the best accuracy in all tasks and use cases. With a new dataset, it can be difficult to determine whi...
Main Author: | Ou, Anthony C. |
---|---|
Other Authors: | Thompson, Neil |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2024
|
Online Access: | https://hdl.handle.net/1721.1/153846 |
Similar Items
-
Waste collection vehicle routing problem benchmark datasets and case studies: A review
by: Idrus, Zanariah, et al.
Published: (2017) -
e-ViL: A dataset and benchmark for natural language explanations in vision-language tasks
by: Kayser, M, et al.
Published: (2022) -
A benchmark comparison of perceptual models for soundscapes on a large-scale augmented soundscape dataset
by: Ooi, Kenneth, et al.
Published: (2023) -
Tidal Benchmarking Project Dataset: R001
by: Harvey, S, et al.
Published: (2022) -
Using The Barton Libraries Dataset As An RDF benchmark
by: Abadi, Daniel J., et al.
Published: (2007)