Page 64 - ComputerScience_Class_11
P. 64

Unicode Codes for Indian Scripts

                                   Language            Script       Starting hexa code  Ending hexa code
                               Urdu              Arabic                  0600              06FF
                               Hindi             Devanagari              0900              097F
                               Bengali           Bengali                 0980              09FF
                               Punjabi           Gurumukhi               0A00              0A7F
                               Gujarati          Gujarati                0A80              0AFF
                               Oriya             Oriya                   0B00              0B7F
                               Tamil             Tamil                   0B80              0BFF
                               Telugu            Telugu                  0C00              0C7F
                               Kannada           Kannada                 0C80              0CFF
                               Malayalam         Malayalam               0D00              0D7F
              Advantages of Unicode are as follows:
              •  A total of 159 scripts having 144697 characters are supported by Unicode which covers all recognised languages in
                the world.
              •  Unicode supports all Internet standards like HTML, XML, Java, JavaScript and Perl.
              •  The Unicode standards ensure interoperability and portability by prescribing conformant behaviour.
              Disadvantage of Unicode is given below:

              •  UTF-16 and UTF-32 require more memory space.

                             Let’s Revisit

                    ♦ Data is a collection of raw facts and figures which when processed gives meaningful information.
                    ♦ A byte is a unit that most computers use to represent any character.
                    ♦ A character set consists of a group of characters that are used to represent a particular language.
                                                                                            4
                    ♦ BCD is one of the oldest coding systems, where each decimal digit is expressed as a group of   bits or a nibble.
                    ♦ ASCII is the most popular coding scheme used as industry standard in computers and on the web.
                    ♦ ISCII is a character encoding standard designed to represent the character set of different Indian languages.
                    ♦ Unicode is an international character encoding standard that includes different languages, scripts and symbols.



                                                            MIND DRILL

                   Solved Questions



              A.  Tick ( ) the correct option.
                       1.  1 Gigabyte is equivalent to ………………… .
                     a.  1024 kilobytes                             b.  1024 megabytes
                     c.  1024 terabytes                             d.  1024 petabytes
                  2.  Which of the following is not an encoding scheme?
                     a.  ASCII                                      b.  ISCII
                     c.  BCD                                        d.  ABC
                  3.  Unicode can encode a character set of ………………… .
                     a.  Urdu                                       b.  Kannada
                     c.  Bengali                                    d.  All of these
                  4.  UTF-8, UTF-16 and UTF-32 are encodings of ………………… .
                     a.  ASCII                                      b.  Unicode
                     c.  ISCII                                      d.  BCD



                   62  Touchpad Computer Science (Ver. 3.0)-XI
   59   60   61   62   63   64   65   66   67   68   69