Go Structs: Difference between revisions

From NovaOrdis Knowledge Base
Jump to navigation Jump to search
 
(43 intermediate revisions by the same user not shown)
Line 36: Line 36:
==<span id='Field'></span>Fields==
==<span id='Field'></span>Fields==


Each field has '''one or more names''' and a '''type'''. The type can be a [[Go_Language#Pre-Declared_Types|pre-declared type]] or a user-defined type, such as another <code>struct</code> or an [[Go_Interfaces#Interfaces_as_Fields|interface]]. No comma is required at the end of the field line. The name(s) can be explicitly listed in the structure, or can be implicit, in case of '''embedded fields'''. A field may optionally have a '''tag'''. <font color=darkkhaki>The struct may also contain methods that operate on the structure's fields.</font> Within a struct, non-blank field names must be unique.  
Each field has '''one or more names''' and a '''type'''. The type can be a [[Go_Language#Pre-Declared_Types|pre-declared type]] or a user-defined type, such as another <code>struct</code> or an [[Go_Interfaces#Interfaces_as_Fields|interface]]. No comma is required at the end of the field line. The name(s) can be explicitly listed in the structure, or can be implicit, in case of [[#Embedded_Fields|embedded fields]]. A field may optionally have a [[#Tag|tag]]. Within a struct, non-blank field names must be unique.  


The fields usually represent a '''has-a''' relationship, consistent with the struct's composite type nature.  
<span id='Has-A_Relationship'></span>A named field usually represents a '''has-a''' relationship, consistent with the struct's composite type nature, and an embedded field represents an '''is-a''' relationship (see [[#Embedded_Fields|below]]).


Fields can be [[Go_Packages#Export|exported]] outside the package declaring the type. For more details see [[#Exporting_Fields|Exporting Fields]] below.
Fields can be [[Go_Packages#Export|exported]] outside the package declaring the type. For more details see the [[#Exporting_Fields|Exporting Fields]] section.


These are named fields:
These are named fields:
Line 63: Line 63:
}
}
</syntaxhighlight>
</syntaxhighlight>
===<span id='Embedded_Type'></span><span id='Embedded_Field'></span>Embedded Fields===
===<span id='Embedded_Type'></span><span id='Embedded_Field'></span><span id='Embedded_Field_Promotion'></span><span id='Promoted_Fields'></span><span id='Embedded_Field_Identity'></span><span id='Overriding_Embedded_Fields'></span><span id=''></span><span id=''></span><span id=''></span><span id=''></span><span id=''></span>Embedded Fields===
When only the type, but not the name of a filed is declared, the name of the field is implicitly the unqualified name of the type, and the field is called an '''embedded field''' or an anonymous field:
<syntaxhighlight lang='go'>
type SomeInterface interface {
  ...
}
 
type SomeOtherStruct struct {
  s string
}
 
type Item struct {
SomeInterface
    SomeOtherStruct
int
}
</syntaxhighlight>
 
Embedded fields, unlike named fields, model a '''is-a''' relationship.
 
An embedded field can be a type name <code>T</code>, a pointer to a non-interface type name (<code>*T</code>), and <code>T</code> itself may not be a pointer type. As mentioned above, the unqualified type name acts as the field name. The previous example declares two embedded fields, one of type <code>int</code> and the other of type <code>SomeInterface</code>, which is an [[Go_Interfaces#Embedded_Type|interface]] declared somewhere in the package. The type renders as:
<font size=-2>
{SomeInterface:<nil> SomeOtherStruct:{s:} int:0}
</font>
Note that if an embedded field is specified using the package and the type name (the qualified type name), the implicit field name is the unqualified type name.
====Embedded Fields and Methods====
A method associated with the embedded type works with the embedding type. This strengthens the point that embedded fields model an "is-a" relationship, and elevates the field embedding mechanism to a sort of inheritance mechanism.
<syntaxhighlight lang='go'>
type Animal struct {
name string
}
 
func (a *Animal) Move() {
fmt.Printf("%s moves\n", a.name)
}


type Dog struct {
Embedded fields, embedded field promotion, promoted fields, embedded field identity, overriding embedded fields are discussed in the following article: {{Internal|Go_Structs_Embedded_Fields#Overview|Embedded Fields}}
Animal
}
 
...
dog := &Dog{Animal{"Fido"}}
dog.Move() // Displays "Fido moves"
</syntaxhighlight>
In the example above, the invocations <code>dog.Move()</code> and <code>dog.Animal.Move()</code> are equivalent.


===<span id='Blank_Field'></span>Blank Fields===
===<span id='Blank_Field'></span>Blank Fields===
Line 116: Line 74:
}
}
</syntaxhighlight>
</syntaxhighlight>
===Promoted Fields===
<font color=darkkhaki>TODO: https://go.dev/ref/spec#Struct_types</font>
===<span id='Tag'></span>Tags===
===<span id='Tag'></span>Tags===
A field declaration may be followed by an optional string literal '''tag''', which becomes an '''attribute''' for all the fields in the corresponding field declaration. The tags are made visible through a reflection interface and take part in type identity for structs but are otherwise ignored.
A field declaration may be followed by an optional string literal '''tag''', which is a string literal declared between <code>`...`</code> (backticks). A tag becomes an '''attribute''' for all the fields in the corresponding field declaration. The tags are made visible through a reflection interface and take part in type identity for structs but are otherwise ignored.
<syntaxhighlight lang='go'>
<syntaxhighlight lang='go'>
type A struct {
type A struct {
Line 125: Line 81:
}
}
</syntaxhighlight>
</syntaxhighlight>
In the example above, a tag has been included to provide '''metadata''' the JSON decoding function needs to parse JSON content. Each tag maps a field name in the struct to a filed name in the JSON document.
In the example above, a tag has been included to provide '''metadata''' the JSON decoding function needs to parse JSON content. Each tag maps a field name in the struct to a filed name in the JSON document. Tags are useful in [[JSON_in_Go#Overview|JSON]] and [[YAML_in_Go#Overview|YAML]] serialization/deserialization.
 
===TODO===
====Tags====
Struct tags  and declared for struct fields. They provide a specification of how the struct fields should be serialized into JSON <font color=darkkhaki>Is that true? Only JSON or it is more generic than that?</font> and how the serialized information should be deserialized into the struct fields.
<syntaxhighlight lang='go'>
type User struct {
  Name string `example:"name"`
}
</syntaxhighlight>
<syntaxhighlight lang='go'>
type User struct {
  Name string `json:"name"`
}
</syntaxhighlight>
<font color=darkkhaki>To PROCESS: https://www.digitalocean.com/community/tutorials/how-to-use-struct-tags-in-go</font>


=<tt>struct</tt> Zero Value=
=<tt>struct</tt> Zero Value=
Line 155: Line 96:
</syntaxhighlight>
</syntaxhighlight>


=Idiomatic Struct Naming Conventions=
=<span id='Idiomatic_Struct_Naming_Conventions'></span>Naming=
Since references include package names, variable names should not be prefixed with the package name. If the package name is <code>xml</code>, use the name <code>Reader</code>, not <code>XMLReader</code>. In the second case, the full name would be <code>xml.XMLReader</code>.
Struct names should follow the general Go naming conventions:
Also see: {{Internal|Go_Style#Naming|Go Style}}
{{Internal|Go_Style#Naming|Go Naming}}


=Literal=
=<span id='Literal'></span>Composite Literal=
Struct literals can have all field initialization values on the same line, or on different lines:
Struct composite literals are expressions that create a new instance of the structure every time they are evaluated. The fields of a composite literal must be laid out in order and must be all present:
<syntaxhighlight lang='go'>
i := Item{"blue", 5}
</syntaxhighlight>
 
However, by labeling the elements explicitly as <code>field:value</code> pairs, the initializers can appear in any order, with the missing ones left as their respective zero values:
<syntaxhighlight lang='go'>
i := Item{size: 5}
</syntaxhighlight>
 
Composite literals can have all field initialization values on the same line, or on different lines:


<syntaxhighlight lang='go'>
<syntaxhighlight lang='go'>
Line 173: Line 124:
</syntaxhighlight>
</syntaxhighlight>


There is "short" literal where the name of the fields are omitted, provided that the order in which they are declared is maintained:
There is "short" literal where the name of the fields are omitted, given that values for '''all''' fields are provided, the order in which they are declared is maintained:


<syntaxhighlight lang='go'>
<syntaxhighlight lang='go'>
i := Item {"blue", 5}
i := Item {"blue", 5}
</syntaxhighlight>
</syntaxhighlight>
It is possible to initialize just some of the fields, but in this case the field names must be provided. The remaining fields will be initialized to their zero value:
<syntaxhighlight lang='go'>
i := Item {color:"blue"}
</syntaxhighlight>
==Embedded Literals==
==Embedded Literals==
When a field of a struct is another struct, embedded literals can be used in initialization:
When a field of a struct is another struct, embedded literals can be used in initialization:
Line 192: Line 149:


=Initialization=
=Initialization=
<span id='new'></span>Initialize a struct variable with an empty struct using the <code>[[Go_Built-In_Function_new|new()]]</code> [[Go_Functions#Built-in_Functions|built-in function]] and the [[Go_Language#Short_Variable_Declaration|short variable declaration]]:
<span id='new'></span>Initialize a struct variable with an empty struct using the <code>[[Go_Functions#new()|new()]]</code> [[Go_Functions#Built-in_Functions|built-in function]] and the [[Go_Language#Short_Variable_Declaration|short variable declaration]]:
<syntaxhighlight lang='go'>
<syntaxhighlight lang='go'>
i := new(Item)
i := new(Item)
Line 199: Line 156:
This results in an "empty" structure, with all fields initialized to zero. Note <code>new()</code> returns a pointer to the structure and not the structure itself.
This results in an "empty" structure, with all fields initialized to zero. Note <code>new()</code> returns a pointer to the structure and not the structure itself.


A [[#Literal|struct literal]] can also be used:
A [[#Literal|struct composite literal]] can also be used to initialize the structure:
<syntaxhighlight lang='go'>
<syntaxhighlight lang='go'>
i := Item{color: "blue", size: 5}
i := Item{color: "blue", size: 5}
</syntaxhighlight>
</syntaxhighlight>
==Make Zero Value Useful==
It is good practice to design the type to be usable with its zero values, right away, without additional initialization, if possible. This means a user of the data structure can create one with <code>new()</code> and get it right to work. For example, the documentation for <code>bytes.Buffer</code> states that "the zero value for <code>Buffer</code> is an empty buffer ready to use". Similarly, <code>sync.Mutex</code> does not have an explicit constructor or <code>Init()</code> method. Instead, the zero value for <code>sync.Mutex</code> is defined to be an unlocked mutex. The zero-value-is-useful property works transitively, if a structure is composed by two types that works while initialized with zero value, the composite structure works while initialized with zero value.
==Initializing Struct Fields to Something Else than Zero Value==
==Initializing Struct Fields to Something Else than Zero Value==
<font color=darkkhaki>It seems not to be possible. If I have a Set structure that internally contains a map, I have two options:
<font color=darkkhaki>It seems not to be possible. If I have a Set structure that internally contains a map, I have two options:
Line 212: Line 174:


=Operators=
=Operators=
==The Dot Operator==
==<span id='The_Dot_Operator'></span>The Selector Operator==
Individual fields in a struct can be read and modified using the "dot notation", the <code>[[Go_Concepts_-_Operators#.|.]]</code> operator.  
Individual fields in a struct can be read and modified using the "dot notation", the <code>[[Go_Concepts_-_Operators#.|.]]</code> [[Go_Language#Selector_Operator|selector operator]].  
<syntaxhighlight lang='go'>
<syntaxhighlight lang='go'>
var i Item
var i Item
Line 222: Line 184:
</syntaxhighlight>
</syntaxhighlight>


Note that the <code>.</code> operator works with a regular struct variable as well as with a pointer to the struct. The compiler knows how to compensate for both of these cases:
<span id='Selector_Operator_Versatility'></span>Note that the <code>.</code> operator works with a regular struct variable as well as with a pointer to the struct. The compiler takes care of the underlying details to access the value and knows how to compensate for both of these cases:


<syntaxhighlight lang='go'>
<syntaxhighlight lang='go'>
i := new(Item)
type SomeStruct struct {
fmt.Printf("i: %p\n", i)   // i is a pointer
  i int
i.color = "blue"            // the dot operator is applied to a pointer
}
i2 := Item{"red", 1}
 
fmt.Printf("i2: %+v\n", i2) // i2 is a struct
s := SomeStruct{10}
i2.color = "yellow"        // the dot operator is applied to a struct
s2 := &s
 
fmt.Printf("%d\n", s.i)   // the selector operator is applied to a value 
fmt.Printf("%d\n", s2.i) // the selector operator is applied to a pointer
</syntaxhighlight>
</syntaxhighlight>
=Exporting Structs=
=Exporting Structs=
<font size='darkkhaki>"Exporting Structs" section needs refactoring.</font>
<font size='darkkhaki>"Exporting Structs" section needs refactoring.</font>

Latest revision as of 02:10, 4 November 2024

External

Internal

Overview

Structs are user-defined composite types, grouping together instances of arbitrary types, into one object.

A struct declaration consists in a sequence of named elements, called fields. Each field has one or more names and a type. The name(s) can be explicitly listed in the structure, or can be implicit, for embedded fields. A field may optionally have a tag. More details about fields can be found in the Fields section.

A struct variable is a value, not a reference variable, which means that no two different struct variables may point to the same struct instance.

Structs can represent data values that could have either a primitive or non-primitive value. "Primitive" structs imply creating a new instance every time an existing value is mutated. In this case, values themselves, rather than pointers, are used to share values of those structs. An example is the Time structure in the time package. However, in most cases, structs exhibit a non-primitive structure: adding or removing something from the value mutates the value, does not create a new value.

Printing Structs

To print the abbreviated struct, and only display values, and not field names, use:

fmt.Println(s)

To print both field names and values, use the %+v format specifier:

fmt.Printf("%+v\n", s)

Declaration

Struct types can be declared at package level or inside a function.

The struct type definition is introduced by the type keyword, to indicate that this is a user-defined type, followed by the new type name and the keyword struct, followed by a curly-brackets-enclosed enumeration of fields.

Once the struct type has been defined in a package or a function, variables of that type can be declared using the long declaration:

var i Item

If no explicit initialization follows, the struct variables declared as above have all their fields initialized with [[Go_Language#Zero_Value|zero values]. Short declaration can also be used, as shown in the Initialization section, below.

Fields

Each field has one or more names and a type. The type can be a pre-declared type or a user-defined type, such as another struct or an interface. No comma is required at the end of the field line. The name(s) can be explicitly listed in the structure, or can be implicit, in case of embedded fields. A field may optionally have a tag. Within a struct, non-blank field names must be unique.

A named field usually represents a has-a relationship, consistent with the struct's composite type nature, and an embedded field represents an is-a relationship (see below).

Fields can be exported outside the package declaring the type. For more details see the Exporting Fields section.

These are named fields:

type Item struct {
    color string
    size int
}

A field with multiple names can be declared as such:

type Item struct {
    a, b, c int
}

We say that fields with the same types can be collapsed. This structure is fully equivalent with the structure where each of the names is declared on its own line:

type Item struct {
    a int
    b int
    c int
}

Embedded Fields

Embedded fields, embedded field promotion, promoted fields, embedded field identity, overriding embedded fields are discussed in the following article:

Embedded Fields

Blank Fields

type Item struct {
	_ float32 // a blank field
    _ int // another blank field
}

Tags

A field declaration may be followed by an optional string literal tag, which is a string literal declared between `...` (backticks). A tag becomes an attribute for all the fields in the corresponding field declaration. The tags are made visible through a reflection interface and take part in type identity for structs but are otherwise ignored.

type A struct {
    Name string `json:"name"`
}

In the example above, a tag has been included to provide metadata the JSON decoding function needs to parse JSON content. Each tag maps a field name in the struct to a filed name in the JSON document. Tags are useful in JSON and YAML serialization/deserialization.

struct Zero Value

Expand this, struct zero value is important when parsing YAML and entire subtrees are missing.

Empty struct

An empty struct allocates zero bytes when values of this type are created. They are useful when a type, but not state, is needed. Example:

// we declared an empty struct that defines the type "A"
type A struct{}

a := A{}
b := struct{}{}

Naming

Struct names should follow the general Go naming conventions:

Go Naming

Composite Literal

Struct composite literals are expressions that create a new instance of the structure every time they are evaluated. The fields of a composite literal must be laid out in order and must be all present:

i := Item{"blue", 5}

However, by labeling the elements explicitly as field:value pairs, the initializers can appear in any order, with the missing ones left as their respective zero values:

i := Item{size: 5}

Composite literals can have all field initialization values on the same line, or on different lines:

i := Item{color: "blue", size: 5}
i := Item {
  color: "blue",
  size: 5, // mandatory comma
}

There is "short" literal where the name of the fields are omitted, given that values for all fields are provided, the order in which they are declared is maintained:

i := Item {"blue", 5}

It is possible to initialize just some of the fields, but in this case the field names must be provided. The remaining fields will be initialized to their zero value:

i := Item {color:"blue"}

Embedded Literals

When a field of a struct is another struct, embedded literals can be used in initialization:

i := Item {
  color: "blue",
  packaging: {
    material: "plastic",
    protectionLevel: 10,
  }
  size: 5,
}

Initialization

Initialize a struct variable with an empty struct using the new() built-in function and the short variable declaration:

i := new(Item)

This results in an "empty" structure, with all fields initialized to zero. Note new() returns a pointer to the structure and not the structure itself.

A struct composite literal can also be used to initialize the structure:

i := Item{color: "blue", size: 5}

Make Zero Value Useful

It is good practice to design the type to be usable with its zero values, right away, without additional initialization, if possible. This means a user of the data structure can create one with new() and get it right to work. For example, the documentation for bytes.Buffer states that "the zero value for Buffer is an empty buffer ready to use". Similarly, sync.Mutex does not have an explicit constructor or Init() method. Instead, the zero value for sync.Mutex is defined to be an unlocked mutex. The zero-value-is-useful property works transitively, if a structure is composed by two types that works while initialized with zero value, the composite structure works while initialized with zero value.

Initializing Struct Fields to Something Else than Zero Value

It seems not to be possible. If I have a Set structure that internally contains a map, I have two options:

1. Use a NewSet() constructor that initializes the map.

2. Do lazy initialization on operations.

Operators

The Selector Operator

Individual fields in a struct can be read and modified using the "dot notation", the . selector operator.

var i Item
i.color = "blue"
i.size = 2

fmt.Printf("color %s, size %d\b", i.color, i.size)

Note that the . operator works with a regular struct variable as well as with a pointer to the struct. The compiler takes care of the underlying details to access the value and knows how to compensate for both of these cases:

type SomeStruct struct {
  i int
}

s := SomeStruct{10}
s2 := &s

fmt.Printf("%d\n", s.i)   // the selector operator is applied to a value  
fmt.Printf("%d\n", s2.i)  // the selector operator is applied to a pointer

Exporting Structs

"Exporting Structs" section needs refactoring.

Even if the enclosing struct type is exported by a package, not all fields are exported automatically, only those whose first character is an upper case letter. This behavior makes possible to have "private" fields in a public structure, when the structure is used outside its package.

Encapsulation and Private Fields

Formally define the semantics of fields that start with lower case names. They are "hidden". What exactly does that mean? Apparently, other package cannot see them.

Aslo see:

Object Oriented Programming in Go | Encapsulation

Exporting Fields

The fields of an exported struct type can be exported or unexported on a field-by-field basis, by naming them with an uppercase and respectively lowercase letter.

What if the struct is named with a lowercase letter and the field starts with an uppercase letter?

Explain this behavior: YAML_in_Go#TODO_Ee4.

Exporting Embedded Fields

If the name of an embedded field starts with a lower case, it is unexported even if its outer type is exported. Even if the name of the inner type is not exported, its fields may be individually exported, and thus accessible from outside the package.

Structs as Receiver Types

Object Oriented Programming in Go | Structs as Receiver Types