1 resource

  • S Christie, Baptiste Moreau-Pernet, Yu T...
    |
    Jul 24th, 2024
    |
    conferencePaper
    S Christie, Baptiste Moreau-Pernet, Yu T...
    Jul 24th, 2024

    Large language models (LLMs) are increasingly being deployed in user-facing applications in educational settings. Deployed applications often augment LLMs with fine-tuning, custom system prompts, and moderation layers to achieve particular goals. However, the behaviors of LLM-powered systems are difficult to guarantee, and most existing evaluations focus instead on the performance of unmodified 'foun-dation' models. Tools for evaluating such deployed systems are currently sparse, inflexible,...

Last update from database: 26/12/2024, 23:15 (UTC)
Powered by Zotero and Kerko.