Go Structs: Difference between revisions

From NovaOrdis Knowledge Base
Jump to navigation Jump to search
 
(225 intermediate revisions by the same user not shown)
Line 1: Line 1:
=External=
* https://go.dev/ref/spec#Struct_types
=Internal=
=Internal=
* [[Go_Language#Structs|Go Language]]


* [[Go_Concepts_-_The_Type_System#Structs|The Type System]]
=Overview=
Structs are user-defined [[Go_Language#Composite_Types|composite types]], grouping together instances of arbitrary types, into one object.
 
A struct declaration consists in a sequence of '''named elements''', called '''fields'''. Each field has one or more names and a type. The name(s) can be explicitly listed in the structure, or can be implicit, for '''embedded fields'''. A field may optionally have a '''tag'''. More details about fields can be found in the [[#Fields|Fields]] section.
 
A struct variable is a value, not a [[Go_Language#.28No.29_Reference_Variables|reference variable]], which means that no two different struct variables may point to the same struct instance.
 
Structs can represent data values that could have either a [[Go_Language#The_Primitive_vs._Non-Primitive_Nature_of_Types|primitive or non-primitive value]]. "Primitive" structs imply creating a new instance every time an existing value is mutated. In this case, values themselves, rather than pointers, are used to share values of those structs. An example is the <code>Time</code> structure in the <code>[[Go Package time|time]]</code> package. However, in most cases, structs exhibit a non-primitive structure: adding or removing something from the value mutates the value, does not create a new value.
 
=Printing Structs=
To print the abbreviated struct, and only display values, and not field names, use:
<syntaxhighlight lang='go'>
fmt.Println(s)
</syntaxhighlight>
 
To print both field names and values, use the  <code>%+v</code> format specifier:
<syntaxhighlight lang='go'>
fmt.Printf("%+v\n", s)
</syntaxhighlight>
 
=Declaration=
Struct types can be declared at package level or inside a function.


=Overview=
The struct type definition is introduced by the <code>type</code> [[Go_Language#Keywords|keyword]], to indicate that this is a user-defined type, followed by the new type name and the keyword <code>struct</code>, followed by a curly-brackets-enclosed enumeration of fields.


A <tt>struct</tt> is a user-defined type that contains named fields (and possibly other type names, see [[#Embedded_Types|embedded types]] below). The fields usually represent a ''has-a'' relationship. <tt>struct</tt>s can also have [[Go_Concepts_-_Functions#Methods|methods]] associated with them.
Once the struct type has been defined in a package or a function, variables of that type can be declared using the [[Go_Language#Variable_Declaration|long declaration]]:
<syntaxhighlight lang='go'>
var i Item
</syntaxhighlight>
If no explicit initialization follows, the struct variables declared as above have all their fields initialized with [[Go_Language#Zero_Value|zero values]. [[Go_Language#Short_Variable_Declaration|Short declaration]] can also be used, as shown in the [[#Initialization|Initialization]] section, below.
==<span id='Field'></span>Fields==


<font color=red>
Each field has '''one or more names''' and a '''type'''. The type can be a [[Go_Language#Pre-Declared_Types|pre-declared type]] or a user-defined type, such as another <code>struct</code> or an [[Go_Interfaces#Interfaces_as_Fields|interface]]. No comma is required at the end of the field line. The name(s) can be explicitly listed in the structure, or can be implicit, in case of [[#Embedded_Fields|embedded fields]]. A field may optionally have a [[#Tag|tag]]. Within a struct, non-blank field names must be unique.
* Are all users can define (in terms of types) structs, or there are other user-defined types?
</font>


=Definition=
<span id='Has-A_Relationship'></span>A named field usually represents a '''has-a''' relationship, consistent with the struct's composite type nature, and an embedded field represents an '''is-a''' relationship (see [[#Embedded_Fields|below]]).


The <tt>struct</tt>s can be declare at package level or inside a function. The struct type definition is introduced by the <tt>type</tt> keyword, to indicated that this is a user-defined type, followed by the type name and the keyword <tt>struct</tt>. Each field has a name and a type.
Fields can be [[Go_Packages#Export|exported]] outside the package declaring the type. For more details see the [[#Exporting_Fields|Exporting Fields]] section.


<pre>
These are named fields:
type myStruct struct {
<syntaxhighlight lang='go'>
     i int
type Item struct {
     s string
     color string
    size int
}
</syntaxhighlight>
A field with multiple names can be declared as such:
<syntaxhighlight lang='go'>
type Item struct {
     a, b, c int
}
}
</pre>
</syntaxhighlight>
We say that fields with the same types can be collapsed. This structure is fully equivalent with the structure where each of the names is declared on its own line:
<syntaxhighlight lang='go'>
type Item struct {
    a int
    b int
    c int
}
</syntaxhighlight>
===<span id='Embedded_Type'></span><span id='Embedded_Field'></span><span id='Embedded_Field_Promotion'></span><span id='Promoted_Fields'></span><span id='Embedded_Field_Identity'></span><span id='Overriding_Embedded_Fields'></span><span id=''></span><span id=''></span><span id=''></span><span id=''></span><span id=''></span>Embedded Fields===


Fields with the same types can be collapsed:
Embedded fields, embedded field promotion, promoted fields, embedded field identity, overriding embedded fields are discussed in the following article: {{Internal|Go_Structs_Embedded_Fields#Overview|Embedded Fields}}


<pre>
===<span id='Blank_Field'></span>Blank Fields===
type myStruct struct {
<syntaxhighlight lang='go'>
     ...
type Item struct {
    i, j, k int
_ float32 // a blank field
  ...
     _ int // another blank field
}
</syntaxhighlight>
===<span id='Tag'></span>Tags===
A field declaration may be followed by an optional string literal '''tag''', which is a string literal declared between <code>`...`</code> (backticks). A tag becomes an '''attribute''' for all the fields in the corresponding field declaration. The tags are made visible through a reflection interface and take part in type identity for structs but are otherwise ignored.
<syntaxhighlight lang='go'>
type A struct {
    Name string `json:"name"`
}
}
</pre>
</syntaxhighlight>
In the example above, a tag has been included to provide '''metadata''' the JSON decoding function needs to parse JSON content. Each tag maps a field name in the struct to a filed name in the JSON document. Tags are useful in [[JSON_in_Go#Overview|JSON]] and [[YAML_in_Go#Overview|YAML]] serialization/deserialization.


=Initialization=
=<tt>struct</tt> Zero Value=
 
<font color=darkkhaki>Expand this, <code>struct</code> zero value is important when [[YAML_in_Go#Behavior_when_an_Entire_Subtree_Is_Missing|parsing YAML and entire subtrees are missing]].</font>
=Empty <tt>struct</tt>=
An empty struct allocates zero bytes when values of this type are created. They are useful when a type, but not state, is needed. Example:
<syntaxhighlight lang='go'>
// we declared an empty struct that defines the type "A"
type A struct{}


==Long Variable Declaration==
a := A{}
b := struct{}{}
</syntaxhighlight>


<pre>
=<span id='Idiomatic_Struct_Naming_Conventions'></span>Naming=
var ms myStruct
Struct names should follow the general Go naming conventions:
</pre>
{{Internal|Go_Style#Naming|Go Naming}}


If no explicit initialization follows, all the struct's fields are initialized with their [[Go Concepts - The Type System#Zero_Value|zero value]].
=<span id='Literal'></span>Composite Literal=
Struct composite literals are expressions that create a new instance of the structure every time they are evaluated. The fields of a composite literal must be laid out in order and must be all present:
<syntaxhighlight lang='go'>
i := Item{"blue", 5}
</syntaxhighlight>


==Short Variable Declaration==
However, by labeling the elements explicitly as <code>field:value</code> pairs, the initializers can appear in any order, with the missing ones left as their respective zero values:
<syntaxhighlight lang='go'>
i := Item{size: 5}
</syntaxhighlight>


===Literal===
Composite literals can have all field initialization values on the same line, or on different lines:


Struct literal initialization:
<syntaxhighlight lang='go'>
i := Item{color: "blue", size: 5}
</syntaxhighlight>


<pre>
<syntaxhighlight lang='go'>
ms := myStruct{i: 5, s: "something"}
i := Item {
</pre>
  color: "blue",
  size: 5, // mandatory comma
}
</syntaxhighlight>


There is a shorter struct literal where the name of the fields are omitted, provided that the order is maintained:
There is "short" literal where the name of the fields are omitted, given that values for '''all''' fields are provided, the order in which they are declared is maintained:


<pre>
<syntaxhighlight lang='go'>
ms := myStruct{5, "something"}
i := Item {"blue", 5}
</pre>
</syntaxhighlight>


===<tt>new()</tt>===
It is possible to initialize just some of the fields, but in this case the field names must be provided. The remaining fields will be initialized to their zero value:
<syntaxhighlight lang='go'>
i := Item {color:"blue"}
</syntaxhighlight>


<tt>[[Go_Built-In_Function_new|new()]]</tt> initialization:
==Embedded Literals==
When a field of a struct is another struct, embedded literals can be used in initialization:
<syntaxhighlight lang='go'>
i := Item {
  color: "blue",
  packaging: {
    material: "plastic",
    protectionLevel: 10,
  }
  size: 5,
}
</syntaxhighlight>


<pre>
=Initialization=
msPtr := new(myStruct)
<span id='new'></span>Initialize a struct variable with an empty struct using the <code>[[Go_Functions#new()|new()]]</code> [[Go_Functions#Built-in_Functions|built-in function]] and the [[Go_Language#Short_Variable_Declaration|short variable declaration]]:
</pre>
<syntaxhighlight lang='go'>
i := new(Item)
</syntaxhighlight>


Note <tt>new()</tt> returns a pointer to the structure and not the structure itself.
This results in an "empty" structure, with all fields initialized to zero. Note <code>new()</code> returns a pointer to the structure and not the structure itself.


=Fields=
A [[#Literal|struct composite literal]] can also be used to initialize the structure:
<syntaxhighlight lang='go'>
i := Item{color: "blue", size: 5}
</syntaxhighlight>


Fields can be access with the <tt>[[Go_Concepts_-_Operators#.|.]]</tt> operator.
==Make Zero Value Useful==


Note that the <tt>.</tt> operator works with a regular <tt>struct</tt> variable as well as with a pointer to a <tt>struct</tt>: the compiler knows how to compensate for both of these cases.
It is good practice to design the type to be usable with its zero values, right away, without additional initialization, if possible. This means a user of the data structure can create one with <code>new()</code> and get it right to work. For example, the documentation for <code>bytes.Buffer</code> states that "the zero value for <code>Buffer</code> is an empty buffer ready to use". Similarly, <code>sync.Mutex</code> does not have an explicit constructor or <code>Init()</code> method. Instead, the zero value for <code>sync.Mutex</code> is defined to be an unlocked mutex. The zero-value-is-useful property works transitively, if a structure is composed by two types that works while initialized with zero value, the composite structure works while initialized with zero value.


<pre>
==Initializing Struct Fields to Something Else than Zero Value==
ms := myStruct{1, "s"}
<font color=darkkhaki>It seems not to be possible. If I have a Set structure that internally contains a map, I have two options:
msPtr := new(myStruct)


// valid:
1. Use a NewSet() constructor that initializes the map.
ms.i


// valid:
2. Do lazy initialization on operations.
msPtr.i
</font>
</pre>


Even if the enclosing struct type is [[Go Concepts - Packages#Exported_Identifiers|exported]] by a package, not all fields are exported automatically, only those whose first character is an upper case letter. This behavior makes possible to have "private" fields in a public structure, when the structure is used outside its package.
=Operators=
==<span id='The_Dot_Operator'></span>The Selector Operator==
Individual fields in a struct can be read and modified using the "dot notation", the <code>[[Go_Concepts_-_Operators#.|.]]</code> [[Go_Language#Selector_Operator|selector operator]].
<syntaxhighlight lang='go'>
var i Item
i.color = "blue"
i.size = 2


=Embedded Types=
fmt.Printf("color %s, size %d\b", i.color, i.size)
</syntaxhighlight>


An embedded type is initialized by declaring a name of a struct inside other <tt>struct</tt> - without associating it with a field. An embedded type is sometimes referred to as ''anonymous field''. It models an ''is-a'' relationship. Example:
<span id='Selector_Operator_Versatility'></span>Note that the <code>.</code> operator works with a regular struct variable as well as with a pointer to the struct. The compiler takes care of the underlying details to access the value and knows how to compensate for both of these cases:


<pre>
<syntaxhighlight lang='go'>
type A struct {
type SomeStruct struct {
    f1 int
  i int
}
}


type B struct {
s := SomeStruct{10}
    A
s2 := &s
    f2 int
 
}
fmt.Printf("%d\n", s.i)  // the selector operator is applied to a value 
</pre>
fmt.Printf("%d\n", s2.i)  // the selector operator is applied to a pointer
</syntaxhighlight>
 
=Exporting Structs=
<font size='darkkhaki>"Exporting Structs" section needs refactoring.</font>


==Methods and Embedded Types==
Even if the enclosing struct type is [[Go_Language_Modularization#Exported_Identifiers|exported]] by a package, not all fields are exported automatically, only those whose first character is an upper case letter. This behavior makes possible to have "private" fields in a public structure, when the structure is used outside its package.
==Encapsulation and Private Fields==
<font color=darkkhaki>
Formally define the semantics of fields that start with lower case names. They are "hidden". What exactly does that mean? Apparently, other package cannot see them.
</font color>
Aslo see: {{Internal|Go_Language_Object_Oriented_Programming#Encapsulation|Object Oriented Programming in Go &#124; Encapsulation}}
==<span id='Exported_Fields'></span>Exporting Fields==
The fields of an [[Go_Packages#Exporting_Package_Members|exported]] <tt>struct</tt> type can be exported or unexported on a field-by-field basis, by naming them with an uppercase and respectively lowercase letter.


A pointer receiver method associated with the embedded type also works with the embedding type. For example, if there's a:
<font color=darkkhaki>What if the struct is named with a lowercase letter and the field starts with an uppercase letter?</font>


<pre>
<font color=darkkhaki>Explain this behavior: [[YAML_in_Go#TODO_Ee4]].</font>
func (a *A) m1(...) {
    ...
}
</pre>
<tt>m1()</tt> also works directly with B instances as such:


<pre>
==<span id='Embedded_Type_Export'></span><span id='Exporting_Embedded_Types'></span>Exporting Embedded Fields==
bPtr := new(B)
If the name of an [[#Embedded_Field|embedded field]] starts with a lower case, it is unexported even if its outer type is [[Go_Packages#Embedded_Type_Export|exported]]. Even if the name of the inner type is not exported, its fields may be individually exported, and thus accessible from outside the package.
...
bPtr.m1(...)
</pre>


In the code sequence above, the invocations <tt>bPtr.m1(...)</tt> and <tt>bPtr.A.m1(...)</tt> are equivalent.
=Structs as Receiver Types=
{{Internal|Go_Language_Object_Oriented_Programming#Structs_as_Receiver_Types|Object Oriented Programming in Go &#124; Structs as Receiver Types}}

Latest revision as of 02:10, 4 November 2024

External

Internal

Overview

Structs are user-defined composite types, grouping together instances of arbitrary types, into one object.

A struct declaration consists in a sequence of named elements, called fields. Each field has one or more names and a type. The name(s) can be explicitly listed in the structure, or can be implicit, for embedded fields. A field may optionally have a tag. More details about fields can be found in the Fields section.

A struct variable is a value, not a reference variable, which means that no two different struct variables may point to the same struct instance.

Structs can represent data values that could have either a primitive or non-primitive value. "Primitive" structs imply creating a new instance every time an existing value is mutated. In this case, values themselves, rather than pointers, are used to share values of those structs. An example is the Time structure in the time package. However, in most cases, structs exhibit a non-primitive structure: adding or removing something from the value mutates the value, does not create a new value.

Printing Structs

To print the abbreviated struct, and only display values, and not field names, use:

fmt.Println(s)

To print both field names and values, use the %+v format specifier:

fmt.Printf("%+v\n", s)

Declaration

Struct types can be declared at package level or inside a function.

The struct type definition is introduced by the type keyword, to indicate that this is a user-defined type, followed by the new type name and the keyword struct, followed by a curly-brackets-enclosed enumeration of fields.

Once the struct type has been defined in a package or a function, variables of that type can be declared using the long declaration:

var i Item

If no explicit initialization follows, the struct variables declared as above have all their fields initialized with [[Go_Language#Zero_Value|zero values]. Short declaration can also be used, as shown in the Initialization section, below.

Fields

Each field has one or more names and a type. The type can be a pre-declared type or a user-defined type, such as another struct or an interface. No comma is required at the end of the field line. The name(s) can be explicitly listed in the structure, or can be implicit, in case of embedded fields. A field may optionally have a tag. Within a struct, non-blank field names must be unique.

A named field usually represents a has-a relationship, consistent with the struct's composite type nature, and an embedded field represents an is-a relationship (see below).

Fields can be exported outside the package declaring the type. For more details see the Exporting Fields section.

These are named fields:

type Item struct {
    color string
    size int
}

A field with multiple names can be declared as such:

type Item struct {
    a, b, c int
}

We say that fields with the same types can be collapsed. This structure is fully equivalent with the structure where each of the names is declared on its own line:

type Item struct {
    a int
    b int
    c int
}

Embedded Fields

Embedded fields, embedded field promotion, promoted fields, embedded field identity, overriding embedded fields are discussed in the following article:

Embedded Fields

Blank Fields

type Item struct {
	_ float32 // a blank field
    _ int // another blank field
}

Tags

A field declaration may be followed by an optional string literal tag, which is a string literal declared between `...` (backticks). A tag becomes an attribute for all the fields in the corresponding field declaration. The tags are made visible through a reflection interface and take part in type identity for structs but are otherwise ignored.

type A struct {
    Name string `json:"name"`
}

In the example above, a tag has been included to provide metadata the JSON decoding function needs to parse JSON content. Each tag maps a field name in the struct to a filed name in the JSON document. Tags are useful in JSON and YAML serialization/deserialization.

struct Zero Value

Expand this, struct zero value is important when parsing YAML and entire subtrees are missing.

Empty struct

An empty struct allocates zero bytes when values of this type are created. They are useful when a type, but not state, is needed. Example:

// we declared an empty struct that defines the type "A"
type A struct{}

a := A{}
b := struct{}{}

Naming

Struct names should follow the general Go naming conventions:

Go Naming

Composite Literal

Struct composite literals are expressions that create a new instance of the structure every time they are evaluated. The fields of a composite literal must be laid out in order and must be all present:

i := Item{"blue", 5}

However, by labeling the elements explicitly as field:value pairs, the initializers can appear in any order, with the missing ones left as their respective zero values:

i := Item{size: 5}

Composite literals can have all field initialization values on the same line, or on different lines:

i := Item{color: "blue", size: 5}
i := Item {
  color: "blue",
  size: 5, // mandatory comma
}

There is "short" literal where the name of the fields are omitted, given that values for all fields are provided, the order in which they are declared is maintained:

i := Item {"blue", 5}

It is possible to initialize just some of the fields, but in this case the field names must be provided. The remaining fields will be initialized to their zero value:

i := Item {color:"blue"}

Embedded Literals

When a field of a struct is another struct, embedded literals can be used in initialization:

i := Item {
  color: "blue",
  packaging: {
    material: "plastic",
    protectionLevel: 10,
  }
  size: 5,
}

Initialization

Initialize a struct variable with an empty struct using the new() built-in function and the short variable declaration:

i := new(Item)

This results in an "empty" structure, with all fields initialized to zero. Note new() returns a pointer to the structure and not the structure itself.

A struct composite literal can also be used to initialize the structure:

i := Item{color: "blue", size: 5}

Make Zero Value Useful

It is good practice to design the type to be usable with its zero values, right away, without additional initialization, if possible. This means a user of the data structure can create one with new() and get it right to work. For example, the documentation for bytes.Buffer states that "the zero value for Buffer is an empty buffer ready to use". Similarly, sync.Mutex does not have an explicit constructor or Init() method. Instead, the zero value for sync.Mutex is defined to be an unlocked mutex. The zero-value-is-useful property works transitively, if a structure is composed by two types that works while initialized with zero value, the composite structure works while initialized with zero value.

Initializing Struct Fields to Something Else than Zero Value

It seems not to be possible. If I have a Set structure that internally contains a map, I have two options:

1. Use a NewSet() constructor that initializes the map.

2. Do lazy initialization on operations.

Operators

The Selector Operator

Individual fields in a struct can be read and modified using the "dot notation", the . selector operator.

var i Item
i.color = "blue"
i.size = 2

fmt.Printf("color %s, size %d\b", i.color, i.size)

Note that the . operator works with a regular struct variable as well as with a pointer to the struct. The compiler takes care of the underlying details to access the value and knows how to compensate for both of these cases:

type SomeStruct struct {
  i int
}

s := SomeStruct{10}
s2 := &s

fmt.Printf("%d\n", s.i)   // the selector operator is applied to a value  
fmt.Printf("%d\n", s2.i)  // the selector operator is applied to a pointer

Exporting Structs

"Exporting Structs" section needs refactoring.

Even if the enclosing struct type is exported by a package, not all fields are exported automatically, only those whose first character is an upper case letter. This behavior makes possible to have "private" fields in a public structure, when the structure is used outside its package.

Encapsulation and Private Fields

Formally define the semantics of fields that start with lower case names. They are "hidden". What exactly does that mean? Apparently, other package cannot see them.

Aslo see:

Object Oriented Programming in Go | Encapsulation

Exporting Fields

The fields of an exported struct type can be exported or unexported on a field-by-field basis, by naming them with an uppercase and respectively lowercase letter.

What if the struct is named with a lowercase letter and the field starts with an uppercase letter?

Explain this behavior: YAML_in_Go#TODO_Ee4.

Exporting Embedded Fields

If the name of an embedded field starts with a lower case, it is unexported even if its outer type is exported. Even if the name of the inner type is not exported, its fields may be individually exported, and thus accessible from outside the package.

Structs as Receiver Types

Object Oriented Programming in Go | Structs as Receiver Types