- Secure Inference for Diffusion Models via Unconditional Scores
I find the paper’s topic of efficient secure inference for diffusion models interesting, its proposed technique clever and its writing of high quality The empirical work is clean and appears reproducible
- CleverBirds: A Multiple-Choice Benchmark for Fine-grained Human . . .
Abstract Mastering fine-grained visual recognition, essential in many expert domains, can re-quire that specialists undergo years of dedicated training Modeling the progression of such expertize in humans remains challenging, and accurately inferring a human learner’s knowledge state is a key step toward understanding visual learning We introduce CleverBirds, a large-scale knowledge
- Jonathan Gratch - OpenReview
ACII 2021 CaSiNo: A Corpus of Campsite Negotiation Dialogues for Automatic Negotiation Systems Kushal Chawla, Jaysa Ramirez, Rene Clever, Gale M Lucas, Jonathan May, Jonathan Gratch 2021 (modified: 04 Jan 2022) NAACL-HLT 2021 Towards Emotion-Aware Agents For Negotiation Dialogues Kushal Chawla, Rene Clever, Jaysa Ramirez, Gale M Lucas
- Language Models as Implicit Tree Search | OpenReview
The second AI acts like a clever "thinking coach," guiding the first one to explore ideas and find smart solutions, similar to how a chess AI master plans moves, but without the usual complex training steps This teamwork means AI can become both a better listener (understanding our preferences) and a sharper thinker (solving difficult problems)
- TRANSFORMERS CAN NAVIGATE MAZES WITH MULTI-STEP PREDICTION
en prediction objectives for basic graph navigation tasks In particular, 114 the work identifies a Clever-Hans cheat based on shortcuts in teacher forced training similar to theo- 15 retical shortcomings identified in Wang et al (2024b) This demonstrates that while transformers can 116 represent world states for mazes, they ma
- On the Planning Abilities of Large Language Models : A Critical . . .
While, as we mentioned earlier, there can be thorny “clever hans” issues about humans prompting LLMs, an automated verifier mechanically backprompting the LLM doesn’t suffer from these We tested this setup on a subset of the failed instances in the one-shot natural language prompt configuration using GPT-4, given its larger context window
- VideoChat-Flash: Hierarchical Compression for Long-Context Video. . .
Long-context video modeling is critical for multimodal large language models (MLLMs), enabling them to process movies, online video streams, and so on Despite its advances, handling long videos
- Counterfactual Debiasing for Fact Verification
579 In this paper, we have proposed a novel counter- factual framework CLEVER for debiasing fact- checking models Unlike existing works, CLEVER is augmentation-free and mitigates biases on infer- ence stage In CLEVER, the claim-evidence fusion model and the claim-only model are independently trained to capture the corresponding information
|