Skip to content

Hash function in minHash.c may not be the best choice #1

Description

@elmtree8

I implemented the first function from this page which works well historically, didn't cause any collisions on my small sample, and produced hashes that gave good results for min_hash_sim.py. However, reading this page makes me wonder if we can work on finding a better one in the future.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions