VPSPulse Mirrors

High-Performance Open-Source Archive

NEWS

privacyR NEWS

Critical fix: Replaced weak hash function in UUID generation with cryptographic hash (MD5) to prevent duplicate UUIDs for different patients in large datasets
Added robust hash function for datasets with more than 5 unique values
Maintains backward compatibility: small datasets (≤5 unique values) use original hash method
Now handles datasets with 1 million+ records without collisions
Added digest package as required dependency for robust hashing
Maintains referential integrity (same input → same UUID) while ensuring uniqueness

For large datasets (>5 unique values): Uses MD5 hash via digest package
For small datasets (≤5 unique values): Uses original hash method (backward compatible)
MD5 collision probability for 1M records: ~10^-15 (negligible)
All functions using UUIDs (anonymize_id, anonymize_names, anonymize_locations) benefit from this fix

Need mirroring services?
Contact our team at info@vpspulse.com.

Mirror powered by VPSpulse

Infrastructure sponsored by VPSPulse & Secure Payments by ArionPay.