Go Strings: Difference between revisions

From NovaOrdis Knowledge Base
Jump to navigation Jump to search
Line 9: Line 9:
=Overview=
=Overview=


The main "use case" for strings is to hold characters made for printing, things you can see, and read. In Go, strings are arrays of bytes that represent characters, encoded using the character encoding standard [[Character_Encoding#Unicode|Unicode]], and, by default, the [[Character_Encoding#UTF-8|UTF-8]] [[Character_Encoding#Character_Encoding_Scheme|character encoding scheme]]. Go refers to Unicode [[Character_Encoding#Code_Point|code points]] as <span id='Rune'></span>'''runes''', instance of the <code>[[Go_Integers#Rune|rune]]</code> type.
The main "use case" for strings is to hold characters made for printing, things you can see, and read. In Go, strings are read-only slices of bytes that represent characters. The language, and the standard library treat strings as containers of [[Character_Encoding#Unicode|Unicode]] characters, encoded in the [[Character_Encoding#UTF-8|UTF-8]] [[Character_Encoding#Character_Encoding_Scheme|character encoding scheme]]. While other languages refer to the string's components as "characters", Go refers to the same components as <span id='Rune'></span>'''runes''', instance of the <code>[[Go_Integers#Rune|rune]]</code> type. They are <font color=darkkhaki>32-bit integers</font> that represent Unicode [[Character_Encoding#Code_Point|code points]].


Strings are '''immutable'''.
Strings are '''immutable'''.

Revision as of 00:48, 22 August 2023

External

Internal

Overview

The main "use case" for strings is to hold characters made for printing, things you can see, and read. In Go, strings are read-only slices of bytes that represent characters. The language, and the standard library treat strings as containers of Unicode characters, encoded in the UTF-8 character encoding scheme. While other languages refer to the string's components as "characters", Go refers to the same components as runes, instance of the rune type. They are 32-bit integers that represent Unicode code points.

Strings are immutable.

A string variable that is not explicitly initialized is implicitly initialized with the empty string.

String Variable Declaration

The pre-declared type identifier for strings is string.

var s string                // string type declaration without initialization
s = "example 1"             // initialization after declaration
var s2 string = "example 2" // variable initialization in declaration
var s3 = "example 3"        // variable initialization with type inference
s4 := "example 4"           // short variable declaration

String Literals

A string literal is a string constant produced by concatenating characters. Go has two kind of string literals: interpreted string literals and raw string literals.

Interpreted String Literals

An interpreted string literal is represented in Go code as a sequence of characters enclosed in double quotes. Each character is a byte, a rune, an UTF-8 code point. Interpreted strings allow escaping (\n or \t).

s := "something\nsomething else"
println(s)

Raw String Literals

Raw string literals are sequences of characters enclosed in backquotes (backticks) `. All characters between the pair of matching backticks is taken literally, back slashes have no special meaning and new lines can appear. Carriage return characters inside raw string literals are discarded.

s := `This
is an \n \t
example of 
raw string literal`

println(s)

will produce:

This
is an \n \t
example of 
raw string literal

Empty String

emptyString1 := ""
emptyString2 := ``

Operators

Indexing Operator []

The indexing operator [] returns a byte (uint8). It does NOT return a rune.

Strings are zero-based indexed. If the index is out of bounds, the runtime generates a run-time panic:

panic: runtime error: index out of range [6] with length 3

Concatenation Operator +

The concatenation operator + joins two strings together, producing a new immutable string instance. An attempt to use the concatenation operation between a string and an int, for example, won't work, because the int won't be automatically converted to string the way Java does.

s := "abc"
s2 := "xyz"
println(s + s2)

Equality Operator ==

String equality is tested with the == operator:

s := "blue"
s2 := "blue"
if s == s2 {
  println("strings are equal")
}

Reading Strings

String Length

The length of a string is obtained by invoking the built-in function len() on the string.

s := "blue"
l := len(s)

Introspect Runes

The unicode package provides a set of function to introspect the string characters (runes).

String Manipulation and Processing in Go

Rune Conversions

TO DISTRIBUTE

Conversion of a byte to string

TO PROCESS:

Conversion between bytes and strings

Reading with a string with a Reader

TO PROCESS:

strings.NewReader()

See Go_Package_strings#NewReader.28.29