A vulnerability in the ArxivReader class of the run-llama/llama_index repository, versions up to v0.12.22.post1, allows for MD5 hash collisions when generating filenames for downloaded papers. This can lead to data loss as papers with identical titles but different contents may overwrite each other, preventing some papers from being processed for AI model training. The issue is resolved in version 0.12.28.
Metrics
Affected Vendors & Products
References
History
Tue, 08 Jul 2025 00:30:00 +0000
Type | Values Removed | Values Added |
---|---|---|
References |
| |
Metrics |
threat_severity
|
threat_severity
|
Mon, 07 Jul 2025 16:15:00 +0000
Type | Values Removed | Values Added |
---|---|---|
Metrics |
ssvc
|
Mon, 07 Jul 2025 10:00:00 +0000
Type | Values Removed | Values Added |
---|---|---|
Description | A vulnerability in the ArxivReader class of the run-llama/llama_index repository, versions up to v0.12.22.post1, allows for MD5 hash collisions when generating filenames for downloaded papers. This can lead to data loss as papers with identical titles but different contents may overwrite each other, preventing some papers from being processed for AI model training. The issue is resolved in version 0.12.28. | |
Title | MD5 Hash Collision in run-llama/llama_index | |
Weaknesses | CWE-440 | |
References |
| |
Metrics |
cvssV3_0
|

Status: PUBLISHED
Assigner: @huntr_ai
Published: 2025-07-07T09:54:22.506Z
Updated: 2025-07-07T15:23:18.518Z
Reserved: 2025-03-31T12:26:26.971Z
Link: CVE-2025-3044

Updated: 2025-07-07T15:23:08.912Z

Status : Received
Published: 2025-07-07T10:15:26.717
Modified: 2025-07-07T16:15:23.013
Link: CVE-2025-3044
