Propagating edits to chapter 10 back

This commit is contained in:
Carol (Nichols || Goulding)
2022-03-22 21:13:37 -04:00
committed by Carol (Nichols || Goulding)
parent 79c42ede54
commit 3fa81cda9f
4 changed files with 280 additions and 286 deletions

View File

@@ -1,42 +1,46 @@
# Generic Types, Traits, and Lifetimes
Every programming language has tools for effectively handling the duplication
of concepts. In Rust, one such tool is *generics*. Generics are abstract
stand-ins for concrete types or other properties. When were writing code, we
can express the behavior of generics or how they relate to other generics
without knowing what will be in their place when compiling and running the code.
of concepts. In Rust, one such tool is *generics*: abstract stand-ins for
concrete types or other properties. We can express the behavior of generics or
how they relate to other generics without knowing what will be in their place
when compiling and running the code.
Similar to the way a function takes parameters with unknown values to run the
same code on multiple concrete values, functions can take parameters of some
generic type instead of a concrete type, like `i32` or `String`. In fact, weve
Functions can take parameters of some generic type, instead of a concrete type
like `i32` or `String`, in the same way a function takes parameters with
unknown values to run the same code on multiple concrete values. In fact, weve
already used generics in Chapter 6 with `Option<T>`, Chapter 8 with `Vec<T>`
and `HashMap<K, V>`, and Chapter 9 with `Result<T, E>`. In this chapter, youll
explore how to define your own types, functions, and methods with generics!
First, well review how to extract a function to reduce code duplication. Next,
well use the same technique to make a generic function from two functions that
First, well review how to extract a function to reduce code duplication. Well
then use the same technique to make a generic function from two functions that
differ only in the types of their parameters. Well also explain how to use
generic types in struct and enum definitions.
Then youll learn how to use *traits* to define behavior in a generic way. You
can combine traits with generic types to constrain a generic type to only
those types that have a particular behavior, as opposed to just any type.
can combine traits with generic types to constrain a generic type to accept
only those types that have a particular behavior, as opposed to just any type.
Finally, well discuss *lifetimes*, a variety of generics that give the
Finally, well discuss *lifetimes*: a variety of generics that give the
compiler information about how references relate to each other. Lifetimes allow
us to borrow values in many situations while still enabling the compiler to
check that the references are valid.
us to give the compiler enough information about borrowed values so that it can
ensure references will be valid in more situations than it could without our
help.
## Removing Duplication by Extracting a Function
Before diving into generics syntax, lets first look at how to remove
duplication that doesnt involve generic types by extracting a function. Then
well apply this technique to extract a generic function! In the same way that
you recognize duplicated code to extract into a function, youll start to
recognize duplicated code that can use generics.
Generics allow us to replace specific types with a placeholder that represents
multiple types to remove code duplication. Before diving into generics syntax,
then, lets first look at how to remove duplication in a way that doesnt
involve generic types by extracting a function that replaces specific values
with a placeholder that represents multiple values. Then well apply the same
technique to extract a generic function! By looking at how to recognize
duplicated code you can extract into a function, youll start to recognize
duplicated code that can use generics.
Consider a short program that finds the largest number in a list, as shown in
Listing 10-1.
We begin with the short program in Listing 10-1 that finds the largest number
in a list.
<span class="filename">Filename: src/main.rs</span>
@@ -44,21 +48,21 @@ Listing 10-1.
{{#rustdoc_include ../listings/ch10-generic-types-traits-and-lifetimes/listing-10-01/src/main.rs:here}}
```
<span class="caption">Listing 10-1: Code to find the largest number in a list
of numbers</span>
<span class="caption">Listing 10-1: Finding the largest number in a list of
numbers</span>
This code stores a list of integers in the variable `number_list` and places
the first number in the list in a variable named `largest`. Then it iterates
through all the numbers in the list, and if the current number is greater than
the number stored in `largest`, it replaces the number in that variable.
However, if the current number is less than or equal to the largest number seen
so far, the variable doesnt change, and the code moves on to the next number
in the list. After considering all the numbers in the list, `largest` should
hold the largest number, which in this case is 100.
We store a list of integers in the variable `number_list` and place the first
number in the list in a variable named `largest`. We then iterate through all
the numbers in the list, and if the current number is greater than the number
stored in `largest`, replace the number in that variable. However, if the
current number is less than or equal to the largest number seen so far, the
variable doesnt change, and the code moves on to the next number in the list.
After considering all the numbers in the list, `largest` should hold the
largest number, which in this case is 100.
To find the largest number in two different lists of numbers, we can duplicate
the code in Listing 10-1 and use the same logic at two different places in the
program, as shown in Listing 10-2.
We've now been tasked with finding the largest number in two different lists of
numbers. To do so, we can choose to duplicate the code in Listing 10-1 and use
the same logic at two different places in the program, as shown in Listing 10-2.
<span class="filename">Filename: src/main.rs</span>
@@ -70,17 +74,18 @@ program, as shown in Listing 10-2.
lists of numbers</span>
Although this code works, duplicating code is tedious and error prone. We also
have to update the code in multiple places when we want to change it.
have to remember to update the code in multiple places when we want to change
it.
To eliminate this duplication, we can create an abstraction by defining a
function that operates on any list of integers given to it in a parameter. This
To eliminate this duplication, well create an abstraction by defining a
function that operates on any list of integers passed in a parameter. This
solution makes our code clearer and lets us express the concept of finding the
largest number in a list abstractly.
In Listing 10-3, we extracted the code that finds the largest number into a
function named `largest`. Unlike the code in Listing 10-1, which can find the
largest number in only one particular list, this program can find the largest
number in two different lists.
In Listing 10-3, we extract the code that finds the largest number into a
function named `largest`. Then we call the function to find the largest number
in the two lists from Listing 10-2. We could also use the function on any other
list of `i32` values we might have in the future.
<span class="filename">Filename: src/main.rs</span>
@@ -92,9 +97,9 @@ number in two different lists.
in two lists</span>
The `largest` function has a parameter called `list`, which represents any
concrete slice of `i32` values that we might pass into the function. As a
result, when we call the function, the code runs on the specific values that we
pass in. Dont worry about the syntax of the `for` loop for now. We arent
concrete slice of `i32` values we might pass into the function. As a result,
when we call the function, the code runs on the specific values that we pass
in. Dont worry about the syntax of the `for` loop for now. We arent
referencing a reference to an `i32` here; were pattern matching and
destructuring each `&i32` that the `for` loop gets so that `item` will be an
`i32` inside the loop body. Well cover pattern matching in detail in [Chapter
@@ -108,10 +113,9 @@ Listing 10-3:
inputs and return values of that code in the function signature.
3. Update the two instances of duplicated code to call the function instead.
Next, well use these same steps with generics to reduce code duplication in
different ways. In the same way that the function body can operate on an
abstract `list` instead of specific values, generics allow code to operate on
abstract types.
Next, well use these same steps with generics to reduce code duplication. In
the same way that the function body can operate on an abstract `list` instead
of specific values, generics allow code to operate on abstract types.
For example, say we had two functions: one that finds the largest item in a
slice of `i32` values and one that finds the largest item in a slice of `char`

View File

@@ -1,6 +1,6 @@
## Generic Data Types
We can use generics to create definitions for items like function signatures or
We use generics to create definitions for items like function signatures or
structs, which we can then use with many different concrete data types. Lets
first look at how to define functions, structs, enums, and methods using
generics. Then well discuss how generics affect code performance.
@@ -13,7 +13,8 @@ parameters and return value. Doing so makes our code more flexible and provides
more functionality to callers of our function while preventing code duplication.
Continuing with our `largest` function, Listing 10-4 shows two functions that
both find the largest value in a slice.
both find the largest value in a slice. We'll then combine these into a single
function that uses generics.
<span class="filename">Filename: src/main.rs</span>
@@ -29,9 +30,9 @@ the largest `i32` in a slice. The `largest_char` function finds the largest
`char` in a slice. The function bodies have the same code, so lets eliminate
the duplication by introducing a generic type parameter in a single function.
To parameterize the types in the new function well define, we need to name the
type parameter, just as we do for the value parameters to a function. You can
use any identifier as a type parameter name. But well use `T` because, by
To parameterize the types in a new single function, we need to name the type
parameter, just as we do for the value parameters to a function. You can use
any identifier as a type parameter name. But well use `T` because, by
convention, parameter names in Rust are short, often just a letter, and Rusts
type-naming convention is CamelCase. Short for “type,” `T` is the default
choice of most Rust programmers.
@@ -63,8 +64,8 @@ compile yet, but well fix it later in this chapter.
{{#rustdoc_include ../listings/ch10-generic-types-traits-and-lifetimes/listing-10-05/src/main.rs}}
```
<span class="caption">Listing 10-5: A definition of the `largest` function that
uses generic type parameters but doesnt compile yet</span>
<span class="caption">Listing 10-5: The `largest` function using generic type
parameters; this doesnt yet compile</span>
If we compile this code right now, well get this error:
@@ -73,21 +74,22 @@ If we compile this code right now, well get this error:
```
The note mentions `std::cmp::PartialOrd`, which is a *trait*. Well talk about
traits in the next section. For now, this error states that the body of
`largest` wont work for all possible types that `T` could be. Because we want
to compare values of type `T` in the body, we can only use types whose values
can be ordered. To enable comparisons, the standard library has the
traits in the next section. For now, know that this error states that the body
of `largest` wont work for all possible types that `T` could be. Because we
want to compare values of type `T` in the body, we can only use types whose
values can be ordered. To enable comparisons, the standard library has the
`std::cmp::PartialOrd` trait that you can implement on types (see Appendix C
for more on this trait). Youll learn how to specify that a generic type has a
particular trait in the [“Traits as Parameters”][traits-as-parameters]<!--
ignore --> section, but lets first explore other ways of using generic type
parameters.
ignore --> section. Before we fix this code (in the section [“Fixing the
`largest` Function with Trait Bounds”][fixing]<!-- ignore -->), lets first
explore other ways of using generic type parameters.
### In Struct Definitions
We can also define structs to use a generic type parameter in one or more
fields using the `<>` syntax. Listing 10-6 shows how to define a `Point<T>`
struct to hold `x` and `y` coordinate values of any type.
fields using the `<>` syntax. Listing 10-6 defines a `Point<T>` struct to hold
`x` and `y` coordinate values of any type.
<span class="filename">Filename: src/main.rs</span>
@@ -100,9 +102,8 @@ values of type `T`</span>
The syntax for using generics in struct definitions is similar to that used in
function definitions. First, we declare the name of the type parameter inside
angle brackets just after the name of the struct. Then we can use the generic
type in the struct definition where we would otherwise specify concrete data
types.
angle brackets just after the name of the struct. Then we use the generic type
in the struct definition where we would otherwise specify concrete data types.
Note that because weve used only one generic type to define `Point<T>`, this
definition says that the `Point<T>` struct is generic over some type `T`, and
@@ -119,8 +120,11 @@ Listing 10-7, our code wont compile.
<span class="caption">Listing 10-7: The fields `x` and `y` must be the same
type because both have the same generic data type `T`.</span>
In this example, when we assign the integer value 5 to `x`, we let the
compiler know that the generic type `T` will be an integer for this instance of
Listing 10-7: The fields `x` and `y` must be the same type because both have
the same generic data type `T`.
In this example, when we assign the integer value 5 to `x`, we let the compiler
know that the generic type `T` will be an integer for this instance of
`Point<T>`. Then when we specify 4.0 for `y`, which weve defined to have the
same type as `x`, well get a type mismatch error like this:
@@ -130,8 +134,8 @@ same type as `x`, well get a type mismatch error like this:
To define a `Point` struct where `x` and `y` are both generics but could have
different types, we can use multiple generic type parameters. For example, in
Listing 10-8, we can change the definition of `Point` to be generic over types
`T` and `U` where `x` is of type `T` and `y` is of type `U`.
Listing 10-8, we change the definition of `Point` to be generic over types `T`
and `U` where `x` is of type `T` and `y` is of type `U`.
<span class="filename">Filename: src/main.rs</span>
@@ -144,8 +148,9 @@ that `x` and `y` can be values of different types</span>
Now all the instances of `Point` shown are allowed! You can use as many generic
type parameters in a definition as you want, but using more than a few makes
your code hard to read. When you need lots of generic types in your code, it
could indicate that your code needs restructuring into smaller pieces.
your code hard to read. If you're finding you need lots of generic types in
your code, it could indicate that your code needs restructuring into smaller
pieces.
### In Enum Definitions
@@ -160,10 +165,10 @@ enum Option<T> {
}
```
This definition should now make more sense to you. As you can see, `Option<T>`
is an enum that is generic over type `T` and has two variants: `Some`, which
This definition should now make more sense to you. As you can see, the
`Option<T>` enum is generic over type `T` and has two variants: `Some`, which
holds one value of type `T`, and a `None` variant that doesnt hold any value.
By using the `Option<T>` enum, we can express the abstract concept of having an
By using the `Option<T>` enum, we can express the abstract concept of an
optional value, and because `Option<T>` is generic, we can use this abstraction
no matter what the type of the optional value is.
@@ -209,21 +214,20 @@ struct we defined in Listing 10-6 with a method named `x` implemented on it.
Here, weve defined a method named `x` on `Point<T>` that returns a reference
to the data in the field `x`.
Note that we have to declare `T` just after `impl` so we can use it to specify
Note that we have to declare `T` just after `impl` so we can use `T` to specify
that were implementing methods on the type `Point<T>`. By declaring `T` as a
generic type after `impl`, Rust can identify that the type in the angle
brackets in `Point` is a generic type rather than a concrete type. Because this
is declaring the generic again, we could have chosen a different name for the
generic parameter than the generic parameter declared in the struct definition,
but using the same name is conventional. Methods written within an `impl` that
declares the generic type will be defined on any instance of the type, no
matter what concrete type ends up substituting for the generic type.
brackets in `Point` is a generic type rather than a concrete type. We could
have chosen a different name for this generic parameter than the generic
parameter declared in the struct definition, but using the same name is
conventional. Methods written within an `impl` that declares the generic type
will be defined on any instance of the type, no matter what concrete type ends
up substituting for the generic type.
The other option we have is defining methods on the type with some constraint
on the generic type. We could, for example, implement methods only on
`Point<f32>` instances rather than on `Point<T>` instances with any generic
type. In Listing 10-10 we use the concrete type `f32`, meaning we dont declare
any types after `impl`.
We can also specify constraints on generic types when defining methods on the
type. We could, for example, implement methods only on `Point<f32>` instances
rather than on `Point<T>` instances with any generic type. In Listing 10-10 we
use the concrete type `f32`, meaning we dont declare any types after `impl`.
<span class="filename">Filename: src/main.rs</span>
@@ -234,14 +238,14 @@ any types after `impl`.
<span class="caption">Listing 10-10: An `impl` block that only applies to a
struct with a particular concrete type for the generic type parameter `T`</span>
This code means the type `Point<f32>` will have a method named
`distance_from_origin` and other instances of `Point<T>` where `T` is not of
type `f32` will not have this method defined. The method measures how far our
point is from the point at coordinates (0.0, 0.0) and uses mathematical
operations that are available only for floating point types.
This code means the type `Point<f32>` will have a `distance_from_origin`
method; other instances of `Point<T>` where `T` is not of type `f32` will not
have this method defined. The method measures how far our point is from the
point at coordinates (0.0, 0.0) and uses mathematical operations that are
available only for floating point types.
Generic type parameters in a struct definition arent always the same as those
you use in that structs method signatures. Listing 10-11 uses the generic
you use in that same structs method signatures. Listing 10-11 uses the generic
types `X1` and `Y1` for the `Point` struct and `X2` `Y2` for the `mixup` method
signature to make the example clearer. The method creates a new `Point`
instance with the `x` value from the `self` `Point` (of type `X1`) and the `y`
@@ -253,7 +257,7 @@ value from the passed-in `Point` (of type `Y2`).
{{#rustdoc_include ../listings/ch10-generic-types-traits-and-lifetimes/listing-10-11/src/main.rs}}
```
<span class="caption">Listing 10-11: A method that uses different generic types
<span class="caption">Listing 10-11: A method that uses generic types different
from its structs definition</span>
In `main`, weve defined a `Point` that has an `i32` for `x` (with value `5`)
@@ -273,22 +277,19 @@ method.
### Performance of Code Using Generics
You might be wondering whether there is a runtime cost when youre using
generic type parameters. The good news is that Rust implements generics in such
a way that your code doesnt run any slower using generic types than it would
with concrete types.
You might be wondering whether there is a runtime cost when using generic type
parameters. The good news is that using generic types won't make your run any
slower than it would with concrete types.
Rust accomplishes this by performing monomorphization of the code that is using
Rust accomplishes this by performing monomorphization of the code using
generics at compile time. *Monomorphization* is the process of turning generic
code into specific code by filling in the concrete types that are used when
compiled.
compiled. In this process, the compiler does the opposite of the steps we used
to create the generic function in Listing 10-5: the compiler looks at all the
places where generic code is called and generates code for the concrete types
the generic code is called with.
In this process, the compiler does the opposite of the steps we used to create
the generic function in Listing 10-5: the compiler looks at all the places
where generic code is called and generates code for the concrete types the
generic code is called with.
Lets look at how this works with an example that uses the standard librarys
Lets look at how this works by using the standard librarys generic
`Option<T>` enum:
```rust
@@ -303,8 +304,7 @@ is `f64`. As such, it expands the generic definition of `Option<T>` into
`Option_i32` and `Option_f64`, thereby replacing the generic definition with
the specific ones.
The monomorphized version of the code looks like the following. The generic
`Option<T>` is replaced with the specific definitions created by the compiler:
The monomorphized version of the code looks like the following:
<span class="filename">Filename: src/main.rs</span>
@@ -325,10 +325,12 @@ fn main() {
}
```
Because Rust compiles generic code into code that specifies the type in each
instance, we pay no runtime cost for using generics. When the code runs, it
performs just as it would if we had duplicated each definition by hand. The
process of monomorphization makes Rusts generics extremely efficient at
runtime.
The generic `Option<T>` is replaced with the specific definitions created by
the compiler. Because Rust compiles generic code into code that specifies the
type in each instance, we pay no runtime cost for using generics. When the code
runs, it performs just as it would if we had duplicated each definition by
hand. The process of monomorphization makes Rusts generics extremely efficient
at runtime.
[traits-as-parameters]: ch10-02-traits.html#traits-as-parameters
[fixing]: ch10-02-traits.html#fixing-the-largest-function-with-trait-bounds

View File

@@ -1,9 +1,9 @@
## Traits: Defining Shared Behavior
A *trait* tells the Rust compiler about functionality a particular type has and
can share with other types. We can use traits to define shared behavior in an
abstract way. We can use trait bounds to specify that a generic type can be any
type that has certain behavior.
A *trait* defines functionality a particular type has and can share with other
types. We can use traits to define shared behavior in an abstract way. We can
use *trait bounds* to specify that a generic type can be any type that has
certain behavior.
> Note: Traits are similar to a feature often called *interfaces* in other
> languages, although with some differences.
@@ -72,20 +72,19 @@ already limited to 280 characters.
`NewsArticle` and `Tweet` types</span>
Implementing a trait on a type is similar to implementing regular methods. The
difference is that after `impl`, we put the trait name that we want to
implement, then use the `for` keyword, and then specify the name of the type we
want to implement the trait for. Within the `impl` block, we put the method
signatures that the trait definition has defined. Instead of adding a semicolon
after each signature, we use curly brackets and fill in the method body with
the specific behavior that we want the methods of the trait to have for the
particular type.
difference is that after `impl`, we put the trait name we want to implement,
then use the `for` keyword, and then specify the name of the type we want to
implement the trait for. Within the `impl` block, we put the method signatures
that the trait definition has defined. Instead of adding a semicolon after each
signature, we use curly brackets and fill in the method body with the specific
behavior that we want the methods of the trait to have for the particular type.
Now that the library has implemented the `Summary` trait on `NewsArticle` and
`Tweet`, users of the crate can call the trait methods on instances of
`NewsArticle` and `Tweet` in the same way we call regular methods. The only
difference is that the trait has to be brought into scope as well as the types
to get the additional trait methods. Heres an example of how a binary crate
could use our `aggregator` library crate:
difference is that the user must bring the trait into scope as well as the
types. Heres an example of how a binary crate could use our `aggregator`
library crate:
```rust,ignore
{{#rustdoc_include ../listings/ch10-generic-types-traits-and-lifetimes/no-listing-01-calling-trait-method/src/main.rs}}
@@ -95,22 +94,22 @@ This code prints `1 new tweet: horse_ebooks: of course, as you probably already
know, people`.
Other crates that depend on the `aggregator` crate can also bring the `Summary`
trait into scope to implement the trait on their own types. One restriction to
note with trait implementations is that we can implement a trait on a type only
if at least one of the trait or the type is local to our crate. For example, we
can implement standard library traits like `Display` on a custom type like
`Tweet` as part of our `aggregator` crate functionality, because the type
`Tweet` is local to our `aggregator` crate. We can also implement `Summary` on
`Vec<T>` in our `aggregator` crate, because the trait `Summary` is local to our
`aggregator` crate.
trait into scope to implement `Summary` on their own types. One restriction to
note is that we can implement a trait on a type only if at least one of the
trait or the type is local to our crate. For example, we can implement standard
library traits like `Display` on a custom type like `Tweet` as part of our
`aggregator` crate functionality, because the type `Tweet` is local to our
`aggregator` crate. We can also implement `Summary` on `Vec<T>` in our
`aggregator` crate, because the trait `Summary` is local to our `aggregator`
crate.
But we cant implement external traits on external types. For example, we cant
implement the `Display` trait on `Vec<T>` within our `aggregator` crate,
because `Display` and `Vec<T>` are defined in the standard library and arent
local to our `aggregator` crate. This restriction is part of a property of
programs called *coherence*, and more specifically the *orphan rule*, so named
because the parent type is not present. This rule ensures that other peoples
code cant break your code and vice versa. Without the rule, two crates could
because `Display` and `Vec<T>` are both defined in the standard library and
arent local to our `aggregator` crate. This restriction is part of a property
called *coherence*, and more specifically the *orphan rule*, so named because
the parent type is not present. This rule ensures that other peoples code
cant break your code and vice versa. Without the rule, two crates could
implement the same trait for the same type, and Rust wouldnt know which
implementation to use.
@@ -121,9 +120,9 @@ in a trait instead of requiring implementations for all methods on every type.
Then, as we implement the trait on a particular type, we can keep or override
each methods default behavior.
Listing 10-14 shows how to specify a default string for the `summarize` method
of the `Summary` trait instead of only defining the method signature, as we did
in Listing 10-12.
In Listing 10-14 we specify a default string for the `summarize` method of the
`Summary` trait instead of only defining the method signature, as we did in
Listing 10-12.
<span class="filename">Filename: src/lib.rs</span>
@@ -131,12 +130,11 @@ in Listing 10-12.
{{#rustdoc_include ../listings/ch10-generic-types-traits-and-lifetimes/listing-10-14/src/lib.rs:here}}
```
<span class="caption">Listing 10-14: Definition of a `Summary` trait with a
default implementation of the `summarize` method</span>
<span class="caption">Listing 10-14: Defining a `Summary` trait with a default
implementation of the `summarize` method</span>
To use a default implementation to summarize instances of `NewsArticle` instead
of defining a custom implementation, we specify an empty `impl` block with
`impl Summary for NewsArticle {}`.
To use a default implementation to summarize instances of `NewsArticle`, we
specify an empty `impl` block with `impl Summary for NewsArticle {}`.
Even though were no longer defining the `summarize` method on `NewsArticle`
directly, weve provided a default implementation and specified that
@@ -149,11 +147,10 @@ the `summarize` method on an instance of `NewsArticle`, like this:
This code prints `New article available! (Read more...)`.
Creating a default implementation for `summarize` doesnt require us to change
anything about the implementation of `Summary` on `Tweet` in Listing 10-13. The
reason is that the syntax for overriding a default implementation is the same
as the syntax for implementing a trait method that doesnt have a default
implementation.
Creating a default implementation doesnt require us to change anything about
the implementation of `Summary` on `Tweet` in Listing 10-13. The reason is that
the syntax for overriding a default implementation is the same as the syntax
for implementing a trait method that doesnt have a default implementation.
Default implementations can call other methods in the same trait, even if those
other methods dont have a default implementation. In this way, a trait can
@@ -192,13 +189,11 @@ overriding implementation of that same method.
### Traits as Parameters
Now that you know how to define and implement traits, we can explore how to use
traits to define functions that accept many different types.
For example, in Listing 10-13, we implemented the `Summary` trait on the
`NewsArticle` and `Tweet` types. We can define a `notify` function that calls
the `summarize` method on its `item` parameter, which is of some type that
implements the `Summary` trait. To do this, we can use the `impl Trait`
syntax, like this:
traits to define functions that accept many different types. We'll use the
`Summary` trait we implemented on the `NewsArticle` and `Tweet` types in
Listing 10-13 to define a `notify` function that calls the `summarize` method
on its `item` parameter, which is of some type that implements the `Summary`
trait. To do this, we use the `impl Trait` syntax, like this:
```rust,ignore
{{#rustdoc_include ../listings/ch10-generic-types-traits-and-lifetimes/no-listing-04-traits-as-parameters/src/lib.rs:here}}
@@ -214,9 +209,8 @@ because those types dont implement `Summary`.
#### Trait Bound Syntax
The `impl Trait` syntax works for straightforward cases but is actually
syntax sugar for a longer form, which is called a *trait bound*; it looks like
this:
The `impl Trait` syntax works for straightforward cases but is actually syntax
sugar for a longer form known as a *trait bound*; it looks like this:
```rust,ignore
pub fn notify<T: Summary>(item: &T) {
@@ -229,18 +223,18 @@ more verbose. We place trait bounds with the declaration of the generic type
parameter after a colon and inside angle brackets.
The `impl Trait` syntax is convenient and makes for more concise code in simple
cases. The trait bound syntax can express more complexity in other cases. For
example, we can have two parameters that implement `Summary`. Using the `impl
Trait` syntax looks like this:
cases, while the fuller trait bound syntax can express more complexity in other
cases. For example, we can have two parameters that implement `Summary`. Doing
so with the `impl Trait` syntax looks like this:
```rust,ignore
pub fn notify(item1: &impl Summary, item2: &impl Summary) {
```
If we wanted this function to allow `item1` and `item2` to have different
types, using `impl Trait` would be appropriate (as long as both types implement
`Summary`). If we wanted to force both parameters to have the same type, thats
only possible to express using a trait bound, like this:
Using `impl Trait` is appropriate if we want this function to allow `item1` and
`item2` to have different types (as long as both types implement `Summary`). If
we want to force both parameters to have the same type, however, we must use a
trait bound, like this:
```rust,ignore
pub fn notify<T: Summary>(item1: &T, item2: &T) {
@@ -253,9 +247,9 @@ passed as an argument for `item1` and `item2` must be the same.
#### Specifying Multiple Trait Bounds with the `+` Syntax
We can also specify more than one trait bound. Say we wanted `notify` to use
display formatting on `item` as well as the `summarize` method: we specify in
the `notify` definition that `item` must implement both `Display` and
`Summary`. We can do so using the `+` syntax:
display formatting as well as `summarize` on `item`: we specify in the `notify`
definition that `item` must implement both `Display` and `Summary`. We can do
so using the `+` syntax:
```rust,ignore
pub fn notify(item: &(impl Summary + Display)) {
@@ -308,14 +302,14 @@ value of some type that implements a trait, as shown here:
By using `impl Summary` for the return type, we specify that the
`returns_summarizable` function returns some type that implements the `Summary`
trait without naming the concrete type. In this case, `returns_summarizable`
returns a `Tweet`, but the code calling this function doesnt know that.
returns a `Tweet`, but the code calling this function doesnt need to know that.
The ability to return a type that is only specified by the trait it implements
is especially useful in the context of closures and iterators, which we cover
in Chapter 13. Closures and iterators create types that only the compiler knows
or types that are very long to specify. The `impl Trait` syntax lets you
concisely specify that a function returns some type that implements the
`Iterator` trait without needing to write out a very long type.
The ability to specify a return type only by the trait it implements is
especially useful in the context of closures and iterators, which we cover in
Chapter 13. Closures and iterators create types that only the compiler knows or
types that are very long to specify. The `impl Trait` syntax lets you concisely
specify that a function returns some type that implements the `Iterator` trait
without needing to write out a very long type.
However, you can only use `impl Trait` if youre returning a single type. For
example, this code that returns either a `NewsArticle` or a `Tweet` with the
@@ -334,10 +328,10 @@ ignore --> section of Chapter 17.
### Fixing the `largest` Function with Trait Bounds
Now that you know how to specify the behavior you want to use using the generic
type parameters bounds, lets return to Listing 10-5 to fix the definition of
the `largest` function that uses a generic type parameter! Last time we tried
to run that code, we received this error:
Now that you know how to specify the behavior you want using the generic type
parameters bounds, lets return to Listing 10-5 to fix the definition of the
`largest` function that uses a generic type parameter! Last time we tried to
run that code, we received this error:
```console
{{#include ../listings/ch10-generic-types-traits-and-lifetimes/listing-10-05/output.txt}}
@@ -366,11 +360,10 @@ With our non-generic versions of the `largest` function, we were only trying to
find the largest `i32` or `char`. As discussed in the [“Stack-Only Data:
Copy”][stack-only-data-copy]<!-- ignore --> section in Chapter 4, types like
`i32` and `char` that have a known size can be stored on the stack, so they
implement the `Copy` trait. But when we made the `largest` function generic,
it became possible for the `list` parameter to have types in it that dont
implement the `Copy` trait. Consequently, we wouldnt be able to move the
value out of `list[0]` and into the `largest` variable, resulting in this
error.
implement the `Copy` trait. But when we made the `largest` function generic, it
became possible for the `list` parameter to have types in it that dont
implement the `Copy` trait. Consequently, we wouldnt be able to move the value
out of `list[0]` and into the `largest` variable, resulting in this error.
To call this code with only those types that implement the `Copy` trait, we can
add `Copy` to the trait bounds of `T`! Listing 10-15 shows the complete code of
@@ -396,14 +389,14 @@ potentially making more heap allocations in the case of types that own heap
data like `String`, and heap allocations can be slow if were working with
large amounts of data.
Another way we could implement `largest` is for the function to return a
reference to a `T` value in the slice. If we change the return type to `&T`
instead of `T`, thereby changing the body of the function to return a
reference, we wouldnt need the `Clone` or `Copy` trait bounds and we could
avoid heap allocations. Try implementing these alternate solutions on your own!
If you get stuck with errors having to do with lifetimes, keep reading: the
“Validating References with Lifetimes” section coming up will explain, but
lifetimes arent required to solve these challenges.
We could also implement `largest` by having the function return a reference to
a `T` value in the slice. If we change the return type to `&T` instead of `T`,
thereby changing the body of the function to return a reference, we wouldnt
need the `Clone` or `Copy` trait bounds and we could avoid heap allocations.
Try implementing these alternate solutions on your own! If you get stuck with
errors having to do with lifetimes, keep reading: the “Validating References
with Lifetimes” section coming up will explain, but lifetimes arent required
to solve these challenges.
### Using Trait Bounds to Conditionally Implement Methods
@@ -411,7 +404,7 @@ By using a trait bound with an `impl` block that uses generic type parameters,
we can implement methods conditionally for types that implement the specified
traits. For example, the type `Pair<T>` in Listing 10-16 always implements the
`new` function to return a new instance of `Pair<T>` (recall from the
[Defining Methods”][methods]<!-- ignore --> section of Chapter 5 that `Self`
[Defining Methods”][methods]<!-- ignore --> section of Chapter 5 that `Self`
is a type alias for the type of the `impl` block, which in this case is
`Pair<T>`). But in the next `impl` block, `Pair<T>` only implements the
`cmp_display` method if its inner type `T` implements the `PartialOrd` trait
@@ -463,13 +456,6 @@ checks for behavior at runtime because weve already checked at compile time.
Doing so improves performance without having to give up the flexibility of
generics.
Another kind of generic that weve already been using is called *lifetimes*.
Rather than ensuring that a type has the behavior we want, lifetimes ensure
that references are valid as long as we need them to be. Lets look at how
lifetimes do that.
[stack-only-data-copy]:
ch04-01-what-is-ownership.html#stack-only-data-copy
[using-trait-objects-that-allow-for-values-of-different-types]:
ch17-02-trait-objects.html#using-trait-objects-that-allow-for-values-of-different-types
[stack-only-data-copy]: ch04-01-what-is-ownership.html#stack-only-data-copy
[using-trait-objects-that-allow-for-values-of-different-types]: ch17-02-trait-objects.html#using-trait-objects-that-allow-for-values-of-different-types
[methods]: ch05-03-method-syntax.html#defining-methods

View File

@@ -1,24 +1,27 @@
## Validating References with Lifetimes
Lifetimes are another kind of generic that weve already been using. Rather
than ensuring that a type has the behavior we want, lifetimes ensure that
references are valid as long as we need them to be.
One detail we didnt discuss in the [“References and
Borrowing”][references-and-borrowing]<!-- ignore --> section in Chapter 4 is
that every reference in Rust has a *lifetime*, which is the scope for which
that reference is valid. Most of the time, lifetimes are implicit and
inferred, just like most of the time, types are inferred. We must annotate
types when multiple types are possible. In a similar way, we must annotate
lifetimes when the lifetimes of references could be related in a few different
ways. Rust requires us to annotate the relationships using generic lifetime
parameters to ensure the actual references used at runtime will definitely be
valid.
that reference is valid. Most of the time, lifetimes are implicit and inferred,
just like most of the time, types are inferred. We only must annotate types
when multiple types are possible. In a similar way, we must annotate lifetimes
when the lifetimes of references could be related in a few different ways. Rust
requires us to annotate the relationships using generic lifetime parameters to
ensure the actual references used at runtime will definitely be valid.
Annotating lifetimes is not even a concept most other programming languages
have, so this is going to feel unfamiliar. Although we wont cover lifetimes in
their entirety in this chapter, well discuss common ways you might encounter
lifetime syntax so you can get introduced to the concept.
lifetime syntax so you can get comfortable with the concept.
### Preventing Dangling References with Lifetimes
The main aim of lifetimes is to prevent dangling references, which cause a
The main aim of lifetimes is to prevent *dangling references*, which cause a
program to reference data other than the data its intended to reference.
Consider the program in Listing 10-17, which has an outer scope and an inner
scope.
@@ -96,10 +99,10 @@ lifetimes of parameters and return values in the context of functions.
### Generic Lifetimes in Functions
Lets write a function that returns the longer of two string slices. This
function will take two string slices and return a string slice. After weve
implemented the `longest` function, the code in Listing 10-20 should print `The
longest string is abcd`.
Well write a function that returns the longer of two string slices. This
function will take two string slices and return a single string slice. After
weve implemented the `longest` function, the code in Listing 10-20 should
print `The longest string is abcd`.
<span class="filename">Filename: src/main.rs</span>
@@ -111,8 +114,8 @@ longest string is abcd`.
function to find the longer of two string slices</span>
Note that we want the function to take string slices, which are references,
because we dont want the `longest` function to take ownership of its
parameters. Refer to the [“String Slices as
rather than strings, because we dont want the `longest` function to take
ownership of its parameters. Refer to the [“String Slices as
Parameters”][string-slices-as-parameters]<!-- ignore --> section in Chapter 4
for more discussion about why the parameters we use in Listing 10-20 are the
ones we want.
@@ -155,18 +158,17 @@ perform its analysis.
### Lifetime Annotation Syntax
Lifetime annotations dont change how long any of the references live. Just
as functions can accept any type when the signature specifies a generic type
parameter, functions can accept references with any lifetime by specifying a
generic lifetime parameter. Lifetime annotations describe the relationships of
the lifetimes of multiple references to each other without affecting the
lifetimes.
Lifetime annotations dont change how long any of the references live. Rather,
they describe the relationships of the lifetimes of multiple references to each
other without affecting the lifetimes. Just as functions can accept any type
when the signature specifies a generic type parameter, functions can accept
references with any lifetime by specifying a generic lifetime parameter.
Lifetime annotations have a slightly unusual syntax: the names of lifetime
parameters must start with an apostrophe (`'`) and are usually all lowercase and
very short, like generic types. Most people use the name `'a`. We place
lifetime parameter annotations after the `&` of a reference, using a space to
separate the annotation from the references type.
parameters must start with an apostrophe (`'`) and are usually all lowercase
and very short, like generic types. Most people use the name `'a` for the first
lifetime annotation. We place lifetime parameter annotations after the `&` of a
reference, using a space to separate the annotation from the references type.
Here are some examples: a reference to an `i32` without a lifetime parameter, a
reference to an `i32` that has a lifetime parameter named `'a`, and a mutable
@@ -192,11 +194,11 @@ lifetime.
Now lets examine lifetime annotations in the context of the `longest`
function. As with generic type parameters, we need to declare generic lifetime
parameters inside angle brackets between the function name and the parameter
list. The constraint we want to express in this signature is that the lifetimes
of both of the parameters and the lifetime of the returned reference are
related such that the returned reference will be valid as long as both the
parameters are. Well name the lifetime `'a` and then add it to each reference,
as shown in Listing 10-22.
list. We want the signature to express the following constraint: the returned
reference will be valid as long as both the parameters are valid. This is the
relationship between lifetimes of the parameters and the return value. Well
name the lifetime `'a` and then add it to each reference, as shown in Listing
10-22.
<span class="filename">Filename: src/main.rs</span>
@@ -229,7 +231,7 @@ substituted for `'a` that will satisfy this signature.
When annotating lifetimes in functions, the annotations go in the function
signature, not in the function body. The lifetime annotations become part of
the contract of the function, much like the types in the signature are. Having
the contract of the function, much like the types in the signature. Having
function signatures contain the lifetime contract means the analysis the Rust
compiler does can be simpler. If theres a problem with the way a function is
annotated or the way it is called, the compiler errors can point to the part of
@@ -262,16 +264,16 @@ references to `String` values that have different concrete lifetimes</span>
In this example, `string1` is valid until the end of the outer scope, `string2`
is valid until the end of the inner scope, and `result` references something
that is valid until the end of the inner scope. Run this code, and youll see
that the borrow checker approves of this code; it will compile and print `The
longest string is long string is long`.
that the borrow checker approves; it will compile and print `The longest string
is long string is long`.
Next, lets try an example that shows that the lifetime of the reference in
`result` must be the smaller lifetime of the two arguments. Well move the
declaration of the `result` variable outside the inner scope but leave the
assignment of the value to the `result` variable inside the scope with
`string2`. Then well move the `println!` that uses `result` outside the inner
scope, after the inner scope has ended. The code in Listing 10-24 will not
compile.
`string2`. Then well move the `println!` that uses `result` to outside the
inner scope, after the inner scope has ended. The code in Listing 10-24 will
not compile.
<span class="filename">Filename: src/main.rs</span>
@@ -282,7 +284,7 @@ compile.
<span class="caption">Listing 10-24: Attempting to use `result` after `string2`
has gone out of scope</span>
When we try to compile this code, well get this error:
When we try to compile this code, we get this error:
```console
{{#include ../listings/ch10-generic-types-traits-and-lifetimes/listing-10-24/output.txt}}
@@ -321,16 +323,17 @@ following code will compile:
{{#rustdoc_include ../listings/ch10-generic-types-traits-and-lifetimes/no-listing-08-only-one-reference-with-lifetime/src/main.rs:here}}
```
In this example, weve specified a lifetime parameter `'a` for the parameter
`x` and the return type, but not for the parameter `y`, because the lifetime of
`y` does not have any relationship with the lifetime of `x` or the return value.
Weve specified a lifetime parameter `'a` for the parameter `x` and the return
type, but not for the parameter `y`, because the lifetime of `y` does not have
any relationship with the lifetime of `x` or the return value.
When returning a reference from a function, the lifetime parameter for the
return type needs to match the lifetime parameter for one of the parameters. If
the reference returned does *not* refer to one of the parameters, it must refer
to a value created within this function, which would be a dangling reference
because the value will go out of scope at the end of the function. Consider
this attempted implementation of the `longest` function that wont compile:
to a value created within this function. However, this would be a dangling
reference because the value will go out of scope at the end of the function.
Consider this attempted implementation of the `longest` function that wont
compile:
<span class="filename">Filename: src/main.rs</span>
@@ -362,10 +365,10 @@ would create dangling pointers or otherwise violate memory safety.
### Lifetime Annotations in Struct Definitions
So far, weve only defined structs to hold owned types. Its possible for
structs to hold references, but in that case we would need to add a lifetime
annotation on every reference in the structs definition. Listing 10-25 has a
struct named `ImportantExcerpt` that holds a string slice.
So far, the structs we've define all hold owned types. We can define structs to
hold references, but in that case we would need to add a lifetime annotation on
every reference in the structs definition. Listing 10-25 has a struct named
`ImportantExcerpt` that holds a string slice.
<span class="filename">Filename: src/main.rs</span>
@@ -394,8 +397,8 @@ the `ImportantExcerpt` goes out of scope, so the reference in the
Youve learned that every reference has a lifetime and that you need to specify
lifetime parameters for functions or structs that use references. However, in
Chapter 4 we had a function in Listing 4-9, which is shown again in Listing
10-26, that compiled without lifetime annotations.
Chapter 4 we had a function in Listing 4-9, shown again in Listing 10-26, that
compiled without lifetime annotations.
<span class="filename">Filename: src/lib.rs</span>
@@ -435,39 +438,38 @@ fits these cases, you dont need to write the lifetimes explicitly.
The elision rules dont provide full inference. If Rust deterministically
applies the rules but there is still ambiguity as to what lifetimes the
references have, the compiler wont guess what the lifetime of the remaining
references should be. In this case, instead of guessing, the compiler will give
you an error that you can resolve by adding the lifetime annotations that
specify how the references relate to each other.
references should be. Instead of guessing, the compiler will give you an error
that you can resolve by adding the lifetime annotations.
Lifetimes on function or method parameters are called *input lifetimes*, and
lifetimes on return values are called *output lifetimes*.
The compiler uses three rules to figure out what lifetimes references have when
there arent explicit annotations. The first rule applies to input lifetimes,
and the second and third rules apply to output lifetimes. If the compiler gets
to the end of the three rules and there are still references for which it cant
figure out lifetimes, the compiler will stop with an error. These rules apply
to `fn` definitions as well as `impl` blocks.
The compiler uses three rules to figure out the lifetimes of the references
when there arent explicit annotations. The first rule applies to input
lifetimes, and the second and third rules apply to output lifetimes. If the
compiler gets to the end of the three rules and there are still references for
which it cant figure out lifetimes, the compiler will stop with an error.
These rules apply to `fn` definitions as well as `impl` blocks.
The first rule is that each parameter that is a reference gets its own lifetime
parameter. In other words, a function with one parameter gets one lifetime
parameter: `fn foo<'a>(x: &'a i32)`; a function with two parameters gets two
separate lifetime parameters: `fn foo<'a, 'b>(x: &'a i32, y: &'b i32)`; and so
on.
The first rule is that the compiler assigns a lifetime parameter to each
parameter thats a reference. In other words, a function with one parameter
gets one lifetime parameter: `fn foo<'a>(x: &'a i32)`; a function with two
parameters gets two separate lifetime parameters: `fn foo<'a, 'b>(x: &'a i32,
y: &'b i32)`; and so on.
The second rule is if there is exactly one input lifetime parameter, that
The second rule is that, if there is exactly one input lifetime parameter, that
lifetime is assigned to all output lifetime parameters: `fn foo<'a>(x: &'a i32)
-> &'a i32`.
The third rule is if there are multiple input lifetime parameters, but one of
them is `&self` or `&mut self` because this is a method, the lifetime of `self`
is assigned to all output lifetime parameters. This third rule makes methods
much nicer to read and write because fewer symbols are necessary.
The third rule is that, if there are multiple input lifetime parameters, but
one of them is `&self` or `&mut self` because this is a method, the lifetime of
`self` is assigned to all output lifetime parameters. This third rule makes
methods much nicer to read and write because fewer symbols are necessary.
Lets pretend were the compiler. Well apply these rules to figure out what
the lifetimes of the references in the signature of the `first_word` function
in Listing 10-26 are. The signature starts without any lifetimes associated
with the references:
Lets pretend were the compiler. Well apply these rules to figure out the
lifetimes of the references in the signature of the `first_word` function in
Listing 10-26. The signature starts without any lifetimes associated with the
references:
```rust,ignore
fn first_word(s: &str) -> &str {
@@ -560,9 +562,9 @@ and all lifetimes have been accounted for.
### The Static Lifetime
One special lifetime we need to discuss is `'static`, which means that this
reference *can* live for the entire duration of the program. All string
literals have the `'static` lifetime, which we can annotate as follows:
One special lifetime we need to discuss is `'static`, which denotes that the
affected reference *can* live for the entire duration of the program. All
string literals have the `'static` lifetime, which we can annotate as follows:
```rust
let s: &'static str = "I have a static lifetime.";
@@ -575,10 +577,10 @@ is always available. Therefore, the lifetime of all string literals is
You might see suggestions to use the `'static` lifetime in error messages. But
before specifying `'static` as the lifetime for a reference, think about
whether the reference you have actually lives the entire lifetime of your
program or not. You might consider whether you want it to live that long, even
if it could. Most of the time, the problem results from attempting to create a
dangling reference or a mismatch of the available lifetimes. In such cases, the
solution is fixing those problems, not specifying the `'static` lifetime.
program or not, and whether you want it to. Most of the time, an error message
suggesting the `'static` lifetime results from attempting to create a dangling
reference or a mismatch of the available lifetimes. In such cases, the solution
is fixing those problems, not specifying the `'static` lifetime.
## Generic Type Parameters, Trait Bounds, and Lifetimes Together