1.2 Text

Cards (7)

  • Character set
    A collection of characters and the corresponding binary codes that represent them

    • All the characters and symbols that can be represented by a computer system; [1 mark]
    • Each character and symbol is assigned a unique value; [1 mark] 
  • ASCII
    Assigns a unique 7-bit binary code to each character, including uppercase and lowercase letters, digits, punctuation marks, and control characters
  • ASCII has limitations in terms of the number of characters it can represent, and it does not support characters from languages other than English
  • Unicode
    A character encoding standard that allows for a greater range of characters and symbols than ASCII, including different languages and emojis
  • Unicode encoding scheme

    Variable-length, assigns a unique code to each character, which can be represented in binary form using multiple bytes
  • As Unicode requires more bits per character than ASCII, it can result in larger file sizes and slower processing times when working with text-based data
  • why does the unicode character set take up more storage space than ASCII
     Each character is encoded using more bits; [1 mark]