houndsniff

Hash identification tool
git clone git://mcdim.xyz/houndsniff.git
Log | Files | Refs | README | LICENSE

README.md (3912B)


      1 # houndsniff
      2 Houndsniff is a minimal hash identification tool written in C that uses a database to compare a hash's features to characteristics of other common hashing algorithm results. It also prints out the percentages of likelihood for each hashing alogirthm to be the one that produced the hash based on their popularity. Email hashes that I missed at <mk@mcdim.xyz>
      3 
      4 Website: https://mcdim.xyz/projects/hs.html
      5 
      6 By Michael Constantine Dimopoulos, et al
      7 
      8 with significant contributions by [Christopher Wellons](https://github.com/skeeto) and revisions & suggestions by [Martin K.](https://github.com/kurahaupo) as well as tuu and fizzie on ##c on Freenode.
      9 
     10 Current version: 2.1
     11 
     12 ![Screenshot](https://blogger.googleusercontent.com/img/a/AVvXsEh7UuJAMRdL4MOA82DH7C2g78X1t_kvsQuKcG-Cww6SqbZ68f_G8TZ3ibOJi7s09bMopa34NkzCbWILgWo0budEUbSlsTtK6GUDogOkZvtZQ0lqhiz2GBBkQkzR7PRDosDHm5RMbcQ6gHOQH3DxL-hJy2ncIlyxIb2gU1fcs5aLaIAM83ezm2NtnrPa=s16000)
     13 
     14 Clone, build & install
     15 ----
     16 Dependencies:
     17 + readline - `sudo apt install libreadline-dev` for Debian & `sudo pacman -S readline` for Arch
     18 ```
     19 $ git clone https://github.com/michaeldim02/houndsniff.git && cd houndsniff/src
     20 $ sudo make install
     21 ```
     22 
     23 Usage
     24 ----
     25 ```
     26 houndsniff [HASH]
     27 ```
     28 or, alternatively you can use the interactive shell:
     29 ```
     30 houndsniff -i
     31 ```
     32 You can exit with ^C.
     33 
     34 You can also use scripting mode, which allows houndsniff to get hashes from stdin and print to stdout without much eye-candy:
     35 
     36 ```
     37 echo "$hash" | houndsnif -s
     38 ```
     39 
     40 Thanks
     41 ----
     42 Thanks a lot to tuu and fizzie, as well as kurahaupo on ##c @ freenode for their revisions and suggestions! (even though I've yet to implement everything). I truly appreciate the help.
     43 
     44 ## Donate
     45 [![paypal](https://mcdim.xyz/media/button_paypal2.png)](https://www.paypal.com/donate/?hosted_button_id=HZXXLD586P5CW)
     46 
     47 More donation options on the website: <a href="https://mcdim.xyz/"/>mcdim.xyz</a>
     48 
     49 Note
     50 ----
     51 It is impossible to mathematically and algorithmically determine which specific algorithm produced a certain hash, and that's definitely not houndsniff is claiming to be able to do. A hash of a specific length and characterset could have been generated by a number of different hashing algorithms, and the hash itself doesn't give further information for one to determine whether or not it was produced by i.e MD5 or RIPEMD128, since both of these algorithms produce 128 bit hashes.
     52 
     53 The only effective way to distinguish between two hashing algorithms that produce hashes of the same characteristics is through the popularity of each hashing algorithm. This is how the percentages are calculated.
     54 
     55 **The percentages are not arbitrary or random**. They are hardcoded in the program in `select.c`, which may give the impression that they are random. This is not the case. They are determined through web search engine results. The numbers are combined into a sum which is then split accordingly into percentages. This process is done semi-automatically with the use a script.
     56 
     57 So, generally, regarding the percentages:
     58 + houndsniff is not meant to give the impression that the percentages are calculated in real time. The percentages are hardcoded into the program.
     59 + Still, the percentages aren't random or arbitrary. I chose to hardcode them in because it would make the tool a lot lighter, easier to maintain, so that it wouldn't require an internet connection to run, and so that it wouldn't randomly bruteforce websites. Hash algorithm popularity does not change overnight, and if it did we would just update houndsniff, so there's no reason why the percentages should be calculated in real time, and not just hardcoded in.
     60 + the percentages are truncated to 2 decimal digits, so they may not add up to 100 exactly.
     61 + only known hashing algorithms are taken into consideration when calculated them.
     62 + and lastly, the percentages are, of course, only as accurate as the web search engine results.