click here if you want to see your banner on this site

Author Topic: What are the best methods to compare similarity in texts?  (Read 214 times)

Zurro0

  • Full Member
  • ***
  • Posts: 143
  • Country: us
  • Karma: +0/-0
  • Gender: Male
    • View Profile
What are the best methods to compare similarity in texts?
« on: June 09, 2019, 03:00:00 PM »
What are the best methods to compare similarity in texts?

mogulkahn

  • Jr. Member
  • **
  • Posts: 90
  • Karma: +0/-0
    • View Profile
Re: What are the best methods to compare similarity in texts?
« Reply #1 on: June 10, 2019, 03:32:31 PM »
The diversity of the answers given so far clearly illustrate the vagueness of the original question.

For a precise answer you need to specify along which dimension(s) you wish to measure textual similarity. The techniques to recommend fully depend on what you want to measure. Possible textual features to consider are:

Text length
Text formatting
Words and/or n-grams used
Stylistic aspects
Topics covered
… to name but a few.

The suitable similarity measure you are asking for depends on which dimensions you are interested in. Cosine similarity works well for 3 while LSA is good for 5, etc.

 

Bitcoin Garden 2013-2024, All rights reserved | Privacy Policy | DMCA | About Bitcoin Garden | Support & Services