#5 Arxiv Weekly Insights

Welcome to the 5th edition of "Arxiv Weekly Insights," where we delve into the latest groundbreaking research and developments from the Arxiv repository.

This newsletter is brought to you by SmartXiv, the AI-powered personalized arXiv digest designed to enhance your research experience. With over 1000 research papers uploaded daily on arXiv, it's easy to miss important updates. Let SmartXiv deliver personalized recommendations so you never miss what truly matters to you.
Get started today and save 30% with your annual subscription.

Logic in Computer Science
Model Counting in the Wild
Arijit Shaw, Kuldeep S. Meel

This paper conducts a rigorous assessment of the scalability of model counters in the wild. The authors evaluate six state-of-the-art model counters on 2262 benchmarks from 11 application domains. The empirical evaluation demonstrates that the performance of model counters varies significantly across different application domains, underscoring the need for careful selection by the end user.

Computers and Society
The News Comment Gap and Algorithmic Agenda Setting in Online Forums
Flora Böwing, Patrick Gildersleve

This paper analyzes 1.2 million comments from Austrian newspaper Der Standard to understand the 'News Comment Gap' and the effects of different ranking algorithms. The authors find that journalists prefer positive, timely, complex, direct responses, while readers favour comments similar to article content from elite authors.

Machine Learning
Transformer Explainer: Interactive Learning of Text-Generative Models
Aeree Cho, Grace C. Kim, Alexander Karpekov, Alec Helbling, Zijie J. Wang, Seongmin Lee, Benjamin Hoover, Duen Horng Chau

Transformer Explainer is an interactive visualization tool designed for non-experts to learn about Transformers through the GPT-2 model. It runs a live GPT-2 instance locally in the user's browser and requires no installation or special hardware.

Computers and Society
Criticizing Ethics According to Artificial Intelligence
Irina Spiegel

This article presents a critique of ethics in the context of artificial intelligence, arguing for the need to question established patterns of thought and traditional authorities, including core concepts such as autonomy, morality, and ethics.

Computer Vision and Pattern Recognition
Improving Network Interpretability via Explanation Consistency Evaluation
Hefeng Wu, Hao Jiang, Keze Wang, Ziyi Tang, Xianghuan He, Liang Lin

While deep neural networks have achieved remarkable performance, they tend to lack transparency in prediction. The pursuit of greater interpretability in neural networks often results in a degradation of their original performance. Some works strive to improve both interpretability and performance, but they primarily depend on meticulously imposed conditions. In this paper, we propose a simple yet effective framework that acquires more explainable activation heatmaps and simultaneously increase the model performance, without the need for any extra supervision. Specifically, our concise framework introduces a new metric, i.e., explanation consistency, to reweight the training samples adaptively in model learning. The explanation consistency metric is utilized to measure the similarity between the model's visual explanations of the original samples and those of semantic-preserved adversarial samples, whose background regions are perturbed by using image adversarial attack techniques. Our framework then promotes the model learning by paying closer attention to those training samples with a high difference in explanations (i.e., low explanation consistency), for which the current model cannot provide robust interpretations. Comprehensive experimental results on various benchmarks demonstrate the superiority of our framework in multiple aspects, including higher recognition accuracy, greater data debiasing capability, stronger network robustness, and more precise localization ability on both regular networks and interpretable networks. We also provide extensive ablation studies and qualitative analyses to unveil the detailed contribution of each component.

Computer Vision and Pattern Recognition
How Well Can Vision Language Models See Image Details?
Chenhui Gou, Abdulwahab Felemban, Faizan Farooq Khan, Deyao Zhu, Jianfei Cai, Hamid Rezatofighi, Mohamed Elhoseiny

This paper explores the ability of Large Language Model-based Vision-Language Models (LLM-based VLMs) to perceive image details beyond the semantic level. The authors introduce a pixel value prediction task (PVP) and find that existing VLMs struggle to predict precise pixel values by only fine-tuning the connection module and LLM. However, prediction precision is significantly improved when the vision encoder is also adapted. The research reveals that incorporating pixel value prediction as one of the VLM pre-training tasks and vision encoder adaptation markedly boosts VLM performance on downstream image-language understanding tasks requiring detailed image perception.


Thank you for joining us this week. Stay tuned for more insights in our next edition. Until then, happy researching! See you next week!