#2 arXiv Weekly Insights

Special offer for SmartXiv -30% for 24 hours

Welcome to "Arxiv Weekly Insights", where we delve into the latest groundbreaking research and developments from the Arxiv repository.

This newsletter is brought to you by SmartXiv, the AI-powered Personalized arXiv Daily Digest designed to enhance your research experience. With over 1000 research papers uploaded daily on arXiv, it's easy to miss important updates. Let SmartXiv deliver personalized recommendations so you never miss what truly matters to you.
Get started today and save 30% with your annual subscription.

Computation and Language
Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models
Zhuo Chen, Jiawei Liu, Haotan Liu, Qikai Cheng, Fan Zhang, Wei Lu, Xiaozhong Liu

This paper explores the vulnerabilities of Retrieval-Enhanced Generative (RAG) models when faced with black-box attacks for opinion manipulation. The proposed attack strategy can significantly alter the opinion polarity of the content generated by RAG, demonstrating the model's vulnerability and the potential negative impact on user cognition and decision-making.

Computational Finance
Temporal Representation Learning for Stock Similarities and Its Applications in Investment Management
Yoontae Hwang, Stefan Zohren, Yongjae Lee

This paper introduces SimStock, a novel temporal self-supervised learning framework that combines techniques from self-supervised learning (SSL) and temporal domain generalization to learn robust and informative representations of financial time series data. The study aims to understand the similarities between stocks from a broader perspective, considering the complex dynamics of the global financial landscape. The effectiveness of SimStock is demonstrated through its application to various investment strategies, such as pairs trading, index tracking, and portfolio optimization.

Computation and Language
LLMs as Function Approximators: Terminology, Taxonomy, and Questions for Evaluation
David Schlangen

This paper proposes a framework for seeing the generality of large language models (LLMs) in their ability to approximate specialist function, based on a natural language specification. This framing brings to the fore questions of the quality of the approximation, and beyond that, also questions of discoverability, stability, and protectability of these functions. The paper examines the potential of data-driven approach to enhance investment decision-making and risk management practices by leveraging the power of temporal self-supervised learning in the face of the ever-changing global financial landscape.

Machine Learning
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review
Masatoshi Uehara, Yulai Zhao, Tommaso Biancalani, Sergey Levine

This tutorial provides a comprehensive survey of methods for fine-tuning diffusion models to optimize downstream reward functions. The tutorial explains the application of various reinforcement learning (RL) algorithms, including PPO, differentiable optimization, reward-weighted MLE, value-weighted sampling, and path consistency learning, tailored specifically for fine-tuning diffusion models. The tutorial aims to explore fundamental aspects such as the strengths and limitations of different RL-based fine-tuning algorithms across various scenarios, the benefits of RL-based fine-tuning compared to non-RL-based approaches, and the formal objectives of RL-based fine-tuning (target distributions).

Computation and Language
Weak-to-Strong Reasoning
Yuqing Yang, Yan Ma, Pengfei Liu

This paper introduces a progressive learning framework that enables the strong model to autonomously refine its training data, without requiring input from either a more advanced model or human-annotated data. The framework begins with supervised fine-tuning on a selective small but high-quality dataset, followed by preference optimization on contrastive samples identified by the strong model itself. The paper demonstrates that the proposed method significantly enhances the reasoning capabilities of Llama2-70B using three separate weak models.

Software Engineering
CoDefeater: Using LLMs To Find Defeaters in Assurance Cases
Usman Gohar, Michael C. Hunter, Robyn R. Lutz, Myra B. Cohen

Constructing assurance cases is a widely used, and sometimes required, process toward demonstrating that safety-critical systems will operate safely in their planned environment. To mitigate the risk of errors and missing edge cases, the concept of defeaters - arguments or evidence that challenge claims in an assurance case - has been introduced. Defeaters can provide timely detection of weaknesses in the arguments, prompting further investigation and timely mitigations. However, capturing defeaters relies on expert judgment, experience, and creativity and must be done iteratively due to evolving requirements and regulations. This paper proposes CoDefeater, an automated process to leverage large language models (LLMs) for finding defeaters. Initial results on two systems show that LLMs can efficiently find known and unforeseen feasible defeaters to support safety analysts in enhancing the completeness and confidence of assurance cases.


Thank you for joining us this week. Stay tuned for more insights in our next edition. Until then, happy researching! See you next week!

Get your -30% Discount on SmartXiv by clicking here!