acl2024

🧙🏻Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models"

nlp benchmark role-playing dialogue dataset reasoning hallucination gpt-4 large-language-models llm llm-agent acl2024

Updated May 29, 2024
Python

thuiar / UMC

Star

Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances (ACL 2024)

clustering discovery intent multimodal-interactions multimodal-deep-learning acl2024

Updated Jun 21, 2024
Python

liamdugan / raid

Star

RAID is the largest and most challenging benchmark for machine-generated text detectors. (ACL 2024)

benchmark detection robustness adversarial-attacks generated-text generated-text-detection acl2024

Updated Jun 5, 2024
Python

parameterlab / trap

Star

Source code of "TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification", ACL2024 (findings)

research fingerprinting adversarial-attacks large-language-models llm acl2024

Updated Mar 1, 2024
Jupyter Notebook

parameterlab / apricot

Star

Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024

research uncertainty calibration uncertainty-quantification confidence large-language-models llms acl2024

Updated Mar 7, 2024
Jupyter Notebook

zhuohaoyu / KIEval

Star

[ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models

machine-learning explainable-ai llm llm-evaluation llm-evaluation-toolkit llm-evaluation-framework llm-evaluation-metrics acl2024

Updated Jun 20, 2024
Python

talreiss / ColdFusion

Star

From Zero to Hero: Cold-Start Anomaly Detection (ACL 2024)

chatbots cold-start anomaly-detection acl2024

Updated Jun 2, 2024
Python

abhayshanbhag2003 / SemEval_Innovators_2024

Star

Code for ACL Shared Task - 10 track 1 ,2, 3 . Achieved rank of 2 and 7 in track 2 and 3 respectively

tensorflow scikit-learn machine-learning-algorithms acl2024

Updated Mar 15, 2024
Jupyter Notebook

xiaomeng-zhu / LIEDER

Star

Repository for the ACL 2024 paper "LIEDER: Linguistically-Informed Evaluation Suite for Discourse Entity Recognition"

natural-language-processing semantic-analysis natural-language-understanding linguistics-dataset acl2024 discourse-entity-recognition

Updated Jun 6, 2024
R

Improve this page

Add a description, image, and links to the acl2024 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the acl2024 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

acl2024

Here are 15 public repositories matching this topic...

IAAR-Shanghai / UHGEval

bigai-nlco / LooGLE

bigai-nlco / langsuite

EagleW / Scientific-Inspiration-Machines-Optimized-for-Novelty

IAAR-Shanghai / NewsBench

zhaochen0110 / Cotempqa

ahnjaewoo / timechara

thuiar / UMC

liamdugan / raid

parameterlab / trap

parameterlab / apricot

zhuohaoyu / KIEval

talreiss / ColdFusion

abhayshanbhag2003 / SemEval_Innovators_2024

xiaomeng-zhu / LIEDER

Improve this page

Add this topic to your repo