PySpect

Home

Invoices

search

top

Package search

trl

  • Summary: Train transformer language models with reinforcement learning.
  • Author: Leandro von Werra
  • Homepage: https://github.com/huggingface/trl
  • Source: https://github.com/huggingface/trl
  • Repo profile
  • Number of releases: 58
  • First release: 0.0.1 on 2020-03-30T16:50:54
  • Latest release: 0.17.0 on 2025-04-24T23:18:40
Dependencies of trl (25).
DependencyOptional
acceleratefalse
datasetsfalse
richfalse
transformersfalse
bitsandbytestrue
deepspeedtrue
diffuserstrue
fastapitrue
joblibtrue
liger-kerneltrue
llm-blendertrue
mergekittrue
openaitrue
parameterizedtrue
pefttrue
Pillowtrue
pydantictrue
pytesttrue
pytest-covtrue
pytest-rerunfailurestrue
pytest-xdisttrue
requeststrue
scikit-learntrue
uvicorntrue
vllmtrue