Jump to content

Zig (programming language)

From Wikipedia, the free encyclopedia
Zig
ParadigmsMulti-paradigm: imperative, concurrent, procedural, functional
Designed byAndrew Kelley
First appeared8 February 2016; 8 years ago (2016-02-08)[1]
Preview release
0.13.0[2] Edit this on Wikidata / 6 June 2024; 5 months ago (6 June 2024)
Typing disciplineStatic, strong, inferred, structural, generic
Memory managementManual
Platformx86-64, ARM64, WebAssembly
Tier 2: ARM, IA-32, RISC-V, MIPS64, POWERPC64, SPARC64, some tier-2 platforms have tier-1 support for standalone programs
OSCross-platform: Linux, FreeBSD, Windows
LicenseMIT
Filename extensions.zig, .zir, .zigr, .zon
Websiteziglang.org
Influenced by
C, C++, LLVM IR, Go, Rust, JavaScript[citation needed]

Zig is an imperative, general-purpose, statically typed, compiled system programming language designed by Andrew Kelley.[3] It is intended as a successor to the language C, with the intent of being even smaller and simpler to program in, while offering more functionality.[4] It is free and open-source software, released under an MIT License.

The improvements in language simplicity relate to flow control, function calls, library imports, variable declaration and Unicode support. Further, the language makes no use of macros or preprocessor instructions. Features adopted from modern languages include the addition of compile time generic programming data types, allowing functions to work on a variety of data, along with a small set of new compiler directives to allow access to the information about those types using reflective programming (reflection).

Another set of additions to Zig is intended to improve code safety. Like C, Zig omits garbage collection, and memory management is manual. To help eliminate the potential errors that arise in such systems, it includes option types, a simple syntax for using them, and a unit testing framework built into the language.

Description

[edit]

Goals

[edit]

The primary goal of Zig is to be a better solution to the sorts of tasks that are currently solved with C. A primary concern in that respect is readability; Zig attempts to use existing concepts and syntax wherever possible, avoiding the addition of different syntax for similar concepts. Further, it is designed for "robustness, optimality and maintainability", including a variety of features to improve safety, optimization, and testing. The small and simple syntax is an important part of the maintenance, as it is a goal of the language to allow maintainers to debug the code without having to learn the intricacies of a language they might not be familiar with.[5] Even with these changes, Zig can compile into and against existing C code; C headers can be included in a Zig project and their functions called, and Zig code can be linked into C projects by including the compiler-built headers.[6]

In keeping with the overall design philosophy of making the code simple and easy to read, the Zig system as a whole also encompasses a number of stylistic changes compared to C and other C-like languages. For instance, the Rust language has operator overloading which means a statement like a = b + c might actually be a function call to a type's overloaded version of the plus operator. Further, that function might panic which might pre-empt any following code. In Zig, if something calls a function, it looks like a function call; if it doesn't, it doesn't look like a function call. If it raises an error, it is explicit in the syntax,[6] error handling is handled through error types and can be handled with catch or try.

The goals of Zig are in contrast to those of many other languages designed in the same time period, like Go, Rust, Carbon, and Nim. Generally, these languages are more complex with added features like operator overloading, functions that masquerade as values (properties), and many other features intended to aid in building large programs. These sorts of features have more in common with C++'s approach, and these languages are more along the lines of that language.[6] Zig has a more conservative extension of the type system, supporting compile time generics and accommodating a form of duck typing with the comptime directive.

Memory handling

[edit]

One of the primary sources of bugs in C programs is the memory management system, based on malloc. malloc sets aside a block of memory for use in the code and returns a reference to that memory as a pointer. There is no system to ensure that memory is released when the program no longer needs it, which can lead to programs using up all available memory, a memory leak. More common is a dangling pointer that does not refer to a properly allocated memory object.[7]

A common solution to these problems is a garbage collector (GC), which examines the program for pointers to previously allocated memory, and removing any blocks that no longer have anything pointing to them. Although this greatly reduces, or even eliminates, memory errors, GC systems are relatively slow compared to manual memory management[citation needed], and have unpredictable performance that makes them unsuited to systems programming. Another solution is automatic reference counting (ARC), which implements the same basic concept of identifying blocks of disused memory, but does so at pointer creation and destruction time by maintaining the number of pointers to a block, meaning there is no need to perform exhaustive pointer searches, which are rendered unnecessary at the cost of adding reference counter adjustment overhead to every pointer creation and destruction operation.[7]

Zig aims to provide performance similar to or better than C, so GC and ARC are not suitable solutions. Instead, it uses a modern, as of 2022, concept known as option types. Instead of a pointer being allowed to point to nothing, or nil, a separate type is used to indicate data that is optionally empty. This is similar to using a structure with a pointer and a boolean that indicates whether the pointer is valid, but the state of the boolean is invisibly managed by the language and does not need to be explicitly managed by the programmer. So, for instance, when the pointer is declared it is set to "unallocated", and when that pointer receives a value from a malloc, it is set to "allocated" if the malloc succeeded.[8]

The advantage to this model is that it has very low or zero overhead; the compiler has to create the code to pass along the optional type when pointers are manipulated, as opposed to a simple pointer, but this allows it to directly express possible memory problems at compile time with no runtime support. For instance, creating a pointer with a null value and then attempting to use it is perfectly acceptable in C, leading to null-pointer errors. In contrast, a language using optional types can check that all code paths only attempt to use pointers when they are valid. While this does not eliminate all potential problems, when issues do occur at runtime the error can be more precisely located and explained.[9]

Another change for memory management in Zig is that the actual allocation is handled through structs describing the action, as opposed to calling the memory management functions in libc. For instance, in C if one wants to write a function that makes a string containing multiple copies of another string, the function might look like this:

const char* repeat(const char* original, size_t times);

In the code, the function would examine the size of original and then malloc times that length to set aside memory for the string it will build. That malloc is invisible to the functions calling it, if they fail to later release the memory, a leak will occur. In Zig, this might be handled using a function like:

fn repeat(allocator: *std.mem.Allocator, original: []const u8, times: usize) std.mem.Allocator.Error![]const u8;

In this code, the allocator variable is passed a struct that describes what code should perform the allocation, and the repeat function returns either the resulting string or, using the optional type as indicated by the !, an Allocator.Error. By directly expressing the allocator as an input, memory allocation is never "hidden" within another function, it is always exposed to the API by the function that is ultimately calling for the memory to be allocated. No allocations are performed inside Zig's standard library. Further, as the struct can point to anything, one can use alternative allocators, even ones written in the program. This can allow, for instance, small-object allocators that do not use the operating system functions that normally allocate an entire memory page.[10]

Optional types are an example of a language feature that offers general functionality while still being simple and generic. They do not have to be used to solve null pointer problems, they are also useful for any type of value where "no value" is an appropriate answer. Consider a function countTheNumberOfUsers that returns an integer, and an integer variable, theCountedUsers that holds the result. In many languages, a magic number would be placed in theCountedUsers to indicate that countTheNumberOfUsers has not yet been called, while many implementations would just set it to zero. In Zig, this could be implemented as an var theCountedUsers: ?i32 = null which sets the variable to a clear "not been called" value.[10]

Another more general feature of Zig that also helps manage memory problems is the concept of defer, which marks some code to be performed at the end of a function no matter what happens, including possible runtime errors. If a particular function allocates some memory and then disposes of it when the operation is complete, one can add a line to defer a free to ensure it is released no matter what happens.[10]

Zig memory management avoids hidden allocations. Allocation is not managed in the language directly. Instead, heap access is done via the standard library, explicitly.[11]

Direct interaction with C

[edit]

Zig promotes an evolutionary approach to using the language that combines new Zig code with existing C code. To do this, it aims to make interaction with existing C libraries as seamless as possible. Zig imports its own libraries with the @import directive, typically in this fashion:

const std = @import("std");

Zig code within that file can now call functions inside std, for instance:

std.debug.print("Hello, world!\n", .{});

To work with C code, one simply replaces the @import with @cImport:

const c = @cImport(@cInclude("soundio/soundio.h"));

The Zig code can now call functions in the soundio library as if they were native Zig code. As Zig uses new data types that are explicitly defined, unlike C's more generic int and float, a small number of directives are used to move data between the C and Zig types, including @intCast and @ptrCast.[10]

Cross compiling

[edit]

Zig treats cross-compiling as a first-class use-case of the language. This means any Zig compiler can compile runnable binaries for any of its target platforms, of which there are dozens. These include not only widely-used modern systems like ARM and x86-64, but also PowerPC, SPARC, MIPS, RISC-V and even the IBM z/Architectures (S390). The toolchain can compile to any of these targets without installing additional software, all the needed support is in the basic system.[10]

Comptime

[edit]

By using the comptime keyword, the programmer can explicitly have Zig evaluate sections of code at compile time, as opposed to runtime. Being able to run code at compile time allows Zig to have the functionality of macros and conditional compilation without the need for a separate preprocessor language.[12]

During compile time, types become first-class citizens. This enables compile-time duck typing, and is how Zig implements generic types.[13]

For instance, in Zig, a generic linked list type might be implemented using a function like:

fn LinkedList(comptime T: type) type;

This function takes in some type T, and returns a custom struct defining a linked list with that data type.

Origin of the name

[edit]

The name 'Zig' was reportedly chosen through a process involving a Python script that randomly combined letters, starting with the letter 'Z' and followed by a vowel or 'Y', in order to generate four-letter words. Despite the intended length, 'Zig', a three-letter word, was ultimately selected from the various combinations produced by the script.[14]

Other features

[edit]

Zig supports compile time generics, reflection, and evaluation, cross-compiling, and manual memory management.[15] A major goal of the language is to improve on the C language,[12][16] while also taking inspiration from Rust,[17][6] among others. Zig has many features for low-level programming, notably packed structs (structs without padding between fields), arbitrary-width integers[18] and multiple pointer types.[13]

Zig is not just a new language: it also includes a C/C++ compiler, and can be used with either or both languages.

Drawbacks

[edit]

Zig has several drawbacks. If memory is not properly deallocated, it can result in memory leaks due to the lack of hidden control.[19] The learning curve for Zig can be steep, especially for those unfamiliar with low-level programming concepts.[19] Although Zig has a growing community, as of 2024, it remains a new language with areas for improvement in maturity, ecosystem and tooling.[19] Interoperability with other languages may pose challenges, as it often requires extra effort to manage data marshaling and communication.[19] Finally, the availability of learning resources is limited for complex use cases, though this is gradually improving as interest and adoption increase.[19]

Versions

[edit]

Since version 0.10 the (new default) Zig compiler is written in the Zig programming language, i.e., it is a self-hosting compiler, and that is a major new feature of that release. The older legacy bootstrapping compiler, written in C++, is still an option but will not be in version 0.11. When compiling with the new Zig compiler much less memory is used and it compiles a bit faster. The older, now legacy, C++ based compiler uses 3.5x more memory.

Zig's default backend for optimization is still LLVM,[20] and LLVM is written in C++. The Zig compiler with LLVM is 169 MiB, vs without LLVM 4.4 MiB. Faster executable code is usually compiled with the new Zig-lang based compiler, its LLVM code generation is better, and it fixes many bugs, but there are also improvements for the older legacy compiler in version 0.10. The self-hosted linker is tightly coupled with the self-hosted compiler. The new version also adds some experimental (tier-3) support for AMD GPUs (there's also some lesser support for Nvidia GPUs and for PlayStation 4 and 5).

The older bootstrapping ("stage1") compiler is written in Zig and C++, using LLVM 13 as a back-end,[21][22] supporting many of its native targets.[23] The compiler is free and open-source software released under an MIT License.[24] The Zig compiler exposes the ability to compile C and C++ similarly to Clang with the commands zig cc and zig c++,[25] providing many headers including the C standard library (libc) and C++ Standard Library (libcxx) for many different platforms, allowing Zig's cc and c++ sub-commands to act as cross compilers out of the box.[26][27]

Plus the operating systems (mostly desktop ones) officially supported (and documented), (minimal) applications can and have been made for Android (with Android NDK), and programming for iOS also possible.

Before version 0.11.0, Zig did not have a package manager, but in 0.11.0 an experimental version was released which was further extended in version 0.12.0. There is no official package repository; instead a package is simply a URL pointing to a compressed file that, when uncompressed, includes a standard build.zig file (that the Zig compiler uses by convention to compile the source code) and, ideally, a build.zig.zon file which is used to define the name and version of the package.

Zig development is funded by the Zig Software Foundation (ZSF), a non-profit corporation with Andrew Kelley as president, which accepts donations and hires multiple full-time employees.[28][29][30]

Examples

[edit]

Hello World

[edit]
const std = @import("std");

pub fn main() !void {
    const stdout = std.io.getStdOut().writer();
    try stdout.print("Hello, {s}!\n", .{"world"});
}

Generic linked list

[edit]
const std = @import("std");
const stdout = std.io.getStdOut().writer();

fn LinkedList(comptime T: type) type {
    return struct {
        const Self = @This();
        pub const Node = struct {
            next: ?*Node = null,
            data: T,
        };

        first: ?*Node = null,

        pub fn prepend(
            list: *Self,
            new_node: *Node,
        ) void {
            new_node.next = list.first;
            list.first = new_node;
        }
        pub fn format(
            list: Self,
            comptime fmt: []const u8,
            options: std.fmt.FormatOptions,
            out_stream: anytype,
        ) !void {
            try out_stream.writeAll("( ");
            var it = list.first;
            while (it) |node| : (it = node.next) {
                try std.fmt.formatType(
                    node.data,
                    fmt,
                    options,
                    out_stream,
                    1,
                );
                try out_stream.writeAll(" ");
            }
            try out_stream.writeAll(")");
        }
    };
}

pub fn main() !void {
    const ListU32 = LinkedList(u32);
    var list = ListU32{};
    var node1 = ListU32.Node{ .data = 1 };
    var node2 = ListU32.Node{ .data = 2 };
    var node3 = ListU32.Node{ .data = 3 };
    list.prepend(&node1);
    list.prepend(&node2);
    list.prepend(&node3);
    try stdout.print("{}\n", .{list});
    try stdout.print("{b}\n", .{list});
}
output
( 3 2 1 ) 
( 11 10 1 )

String repetition with allocator

[edit]
const std = @import("std");

fn repeat(
    allocator: *std.mem.Allocator,
    original: []const u8,
    times: usize,
) std.mem.Allocator.Error![]const u8 {
    var buffer = try allocator.alloc(
        u8,
        original.len * times,
    );

    for (0..times) |i| {
        std.mem.copyForwards(
            u8,
            buffer[(original.len * i)..],
            original,
        );
    }

    return buffer;
}

pub fn main() !void {
    const stdout = std.io.getStdOut().writer();

    var arena = std.heap.ArenaAllocator.init(
        std.heap.page_allocator,
    );
    defer arena.deinit();

    var allocator = arena.allocator();

    const original = "Hello ";
    const repeated = try repeat(
        &allocator,
        original,
        3,
    );

    // Prints "Hello Hello Hello "
    try stdout.print("{s}\n", .{repeated});
}

output

Hello Hello Hello

Projects

[edit]

See also

[edit]

References

[edit]

Citations

[edit]
  1. ^ Kelley, Andrew. "Introduction to the Zig Programming Language". andrewkelley.me. Retrieved 8 November 2020.
  2. ^ "Release 0.13.0".
  3. ^ "Taking the warts off C, with Andrew Kelley, creator of the Zig programming language". Sourcegraph. 2021-10-19. Retrieved 2024-04-18.
  4. ^ "Zig has all the elegant simplicity of C, minus all the ways to shoot yourself in the foot". JAXenter. 2017-10-31. Archived from the original on 2017-11-01. Retrieved 2020-02-11.
  5. ^ Elizabeth 2017.
  6. ^ a b c d Yegulalp 2016.
  7. ^ a b "ARC vs. GC". Elements.
  8. ^ "Guide To Java 8 Optional". 28 November 2022.
  9. ^ "Rust: Memory Management".
  10. ^ a b c d e "Allocators". 11 September 2023.
  11. ^ Tyson, Matthew (9 March 2023). "Meet Zig: The modern alternative to C". InfoWorld.com.
  12. ^ a b The Road to Zig 1.0 - Andrew Kelley. ChariotSolutions. 2019-05-09 – via YouTube.
  13. ^ a b "Documentation". Ziglang.org. Retrieved 2020-04-24.
  14. ^ andrewrk (2024-03-13). "origin of the zig programming language name. by @andrewrk". Retrieved 2024-03-13.
  15. ^ "The Zig Programming Language". Ziglang.org. Retrieved 2020-02-11.
  16. ^ "The Zig Programming Language". Ziglang.org. Retrieved 2020-02-11.
  17. ^ Company, Sudo Null. "Sudo Null - IT News for you". SudoNull. Retrieved 2020-02-11. {{cite web}}: |last= has generic name (help)
  18. ^ Tim Anderson 24 Apr 2020 at 09:50. "Keen to go _ExtInt? LLVM Clang compiler adds support for custom width integers". www.theregister.co.uk. Retrieved 2020-04-24.{{cite web}}: CS1 maint: numeric names: authors list (link)
  19. ^ a b c d e Chigozie, Oduah (2024-06-04). "Comparing Rust vs. Zig: Performance, Safety, and More". LogRocket Blog. Retrieved 2024-07-16.
  20. ^ New LLVM version 15, Zig legacy uses version 13
  21. ^ "A Reply to _The Road to Zig 1.0_". www.gingerbill.org. 2019-05-13. Retrieved 2020-02-11.
  22. ^ "ziglang/zig". GitHub. Zig Programming Language. 2020-02-11. Retrieved 2020-02-11.
  23. ^ "The Zig Programming Language". Ziglang.org. Retrieved 2020-02-11.
  24. ^ "ziglang/zig". GitHub. Retrieved 2020-02-11.
  25. ^ "0.6.0 Release Notes". Ziglang.org. Retrieved 2020-04-19.
  26. ^ "'zig cc': a Powerful Drop-In Replacement for GCC/Clang - Andrew Kelley". andrewkelley.me. Retrieved 2021-05-28.
  27. ^ "Zig Makes Go Cross Compilation Just Work". DEV Community. 24 January 2021. Retrieved 2021-05-28.
  28. ^ "Jakub Konka on Twitter". Twitter. Archived from the original on 2022-04-10. Retrieved 2021-05-28.
  29. ^ "Announcing the Zig Software Foundation". Ziglang.org. Retrieved 2021-05-28.
  30. ^ "Sponsor ZSF". Ziglang.org. Retrieved 2021-05-28.

Bibliography

[edit]
[edit]