Page 294 - AI Ver 1.0 Class 10
P. 294

healer                -er               heal


                                           studies               -es               studi

                                           studying              -ing              study




              Step 6: Lemmatization
              This is also a process of removing the affixes from the words to create a meaningful base word. The word we get
              after removing the affix is called lemma. Since it always focuses on creating a meaningful lemma, the processing
              time is longer and better from stemming.

                                             Word              Affixes            Lemma

                                            healed               -ed               heal

                                            healing              -ing              heal

                                            healer               -er               heal


                                            studies              -es               study

                                            studying             -ing              study


              Difference between Stemming and Lemmatization


              The difference between stemming and lemmatization can be summarised by this example:



                                          Taking         Stemming                   Tak







                                          Taking         Lemmatization             Take







                       Techniques of Natural Language Processing



              There are many techniques used in NLP for extracting information but the three given below are most commonly
              used:

              1. Bag Of Words

              2. Term Frequency and Inverse Document Frequency (TFIDF)

              3. NLTK

                        292   Touchpad Artificial Intelligence-X
   289   290   291   292   293   294   295   296   297   298   299