Integrate vLLM Evaluator · Issue #23 · amazon-science/fmcore · GitHub | Latest TMZ Celebrity News & Gossip | Watch TMZ Live
Skip to content

Integrate vLLM Evaluator #23

Open
@adivekar-utexas

Description

@adivekar-utexas

vLLM is a high-throughput LLM evaluator which runs on HuggingFace models, performing various kinds of model sharding across GPUs using Ray backend.
In its basic form, vLLM is a great speedup over AccelerateEvaluator, which is quite slow.

Basic requirements:

  1. Should be compatible with RayEvaluator (and GenerativeLM if needed).
  2. Should support only single-node models; scaling up models should require larger nodes (design choice for better execution speed).
  3. Should integrate with all HF transformers LLMs.

Metadata

Metadata

Labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions

    TMZ Celebrity News – Breaking Stories, Videos & Gossip

    Looking for the latest TMZ celebrity news? You've come to the right place. From shocking Hollywood scandals to exclusive videos, TMZ delivers it all in real time.

    Whether it’s a red carpet slip-up, a viral paparazzi moment, or a legal drama involving your favorite stars, TMZ news is always first to break the story. Stay in the loop with daily updates, insider tips, and jaw-dropping photos.

    🎥 Watch TMZ Live

    TMZ Live brings you daily celebrity news and interviews straight from the TMZ newsroom. Don’t miss a beat—watch now and see what’s trending in Hollywood.