Spreadsheet comparison | Sololearn: Learn to code for FREE!
New course! Every coder should learn Generative AI!
Try a free lesson
+ 1

Spreadsheet comparison

Any thoughts on how to use python to compare two excel spreadsheet for similarities? I work with a lot of information where I upload information from different locations.

28th Jul 2019, 4:29 PM
Dave Merrique
Dave Merrique - avatar
1 Answer
+ 1
You might benefit from using pandas, numpy and maybe nltk, but the data sets will have to similar data types. So, integers compared to integers, strings to strings, etc. Pandas can pull down the excel data, numpy can arrange the datasets, you can probably compare at this level. The nltk library will help with tokenizing strings, defining stop words and comparing strings. After defining these methods, you can provide the two excel spreadsheets to compare. I recently created a string comparator on SoloLearn, its code is below, it might also help and it has fewer moving parts, especially if its string comparing https://code.sololearn.com/cBMcdSFzU654/?ref=app
28th Jul 2019, 8:58 PM
Steven M
Steven M - avatar