Customizable LLM Evaluation: Benchmarking Gemma and Beyond
This post introduces Benchmark-Gemma-Models, an open-source toolkit I developed to make evaluating Large Language Models (LLMs) more accessible and meaningful.
This post introduces Benchmark-Gemma-Models, an open-source toolkit I developed to make evaluating Large Language Models (LLMs) more accessible and meaningful.
This post provides an overview of my Google Summer of Code 2024 research project with HumanAI, where I investigated how language and themes evolve within Dark Web communities.