Unikraft is a fast, secure and open-source Unikernel Development Kit

ryukoposting · on Feb 7, 2022

I'm an embedded guy by trade, so the idea of a Unikernel is nothing new to me. But wait... use cases overlapping with general-purpose OSes? nginx benchmarks??? This is exciting.

I know DevOps for bare-metal firmware is a PITA partly because of the tightly-coupled application, kernel, and libraries. I'm hoping someone familiar with Unikraft/OSv/etc could sate my curiosity...

- Do you test your app inside a container before building your Unikraft/OSv image? Or is there a way to create a CI/CD pipeline that builds your unikernel executable + tests the whole thing as a compiled unit?

- How often do bugs appear in Unikraft that don't appear when running the app on a traditional OS? To what extent does the complexity of the app's dependencies affect this?

- In terms of convenience, how does Unikraft/OSv compare to using a highly-customizable general-purpose* OS like Gentoo?

(*edit for clarity: "general-purpose OS" in the sense that it 1) can load arbitrary data using one or more filesystems 2) can execute loaded data as a program 3) has means by which a human or human-controlled machine may cause the OS to load and execute said programs. This definition does not exclude highly-specialized Gentoo/Nix/whatever setups that are tailored to run a particular program)

nderjung · on Feb 7, 2022

1. We build unikernels using the 'kraft container' which is Docker/OCI image[0][1] which has the necessary build tools to build Unikraft unikernels. We plug this into Concourse CI which builds thousands of combinations of Unikernels[2] as part of our code review process[3]. In addition to this, we have on-going research and tooling to help automatically discover permutations of Unikernel builds[4]. We then run the unikernel natively using the target platform/hardware combination or use QEMU emulation.

2. Really great question, but mostly you can expect the same functionality of an application when it runs as a unikernel because the application "thinks" it's still running in a traditional OS environment -- as it should be. Check out this documentation[5] (after step 7) about porting, it has snippets about where the boundary sometimes breaks. Even then, you can run it in POSIX-compatibility mode[6].

3. Well, general-purposes are not suited for deployment environments (which is what Unikraft is suited for). Installing Gentoo (or Ubuntu, Debian, for that matter) is a waste of resources if you only SSH in once to install your desired application.

[0]: https://unikraft.org/docs/usage/install/#docker

[1]: https://github.com/unikraft/kraft/tree/staging/package/docke...

[2]: https://builds.unikraft.io

[3]: https://unikraft.org/docs/contributing/review-process/#stage...

[4]: https://github.com/lancs-net/wayfinder

[5]: https://unikraft.org/docs/develop/porting/#providing-build-f...

[6]: https://unikraft.org/docs/features/posix-compatibility/

ryukoposting · on Feb 7, 2022

I edited my comment to clarify this, but I meant "general-purpose OS" in a very broad sense - i.e. not much more than "can it load, execute, and time-share between arbitrary programs loaded from an attached storage device."

Thanks for the links!

nderjung · on Feb 7, 2022

The application code is compiled along with kernel code so you can think of unikernels as a single-process VM, there's nothing else running other than the application so it boots straight to `main()`. Unikraft just facilitates the runtime of the application to be able to run as a VM or on baremetal. There's no shell, so you can't instantiate another program from disk. If the application wishes to read and write from an attached storage, it can, but it can't start another process if it's reading application code to-be-executed. Starting another process is a bit tricker since there is no `fork()` to execute another process. Interesting work is being done to enable multi-threading across cores via SMP[0] however and to provide fork like ability but with regard to the application's logic[1] but not in a wider general multi-processing environment. I hope this clarifies things.

[0]: https://github.com/unikraft/unikraft/pull/244

[1]: https://xen2021.sched.com/event/jAME/cloning-unikernels-on-x...

ryukoposting · on Feb 7, 2022

I'm familiar with unikernels as a concept - heck, I wrote (a bad) one for a research project during my undergrad with embedded systems and softcore processors as the target. It's exciting to see unikernels being used with software of a similar scale, but with a much more broad solution space.

mscdex · on Feb 7, 2022

I was a bit surprised to find out that Unikraft does not yet support multiple cores/CPUs (at least on kvm and x86): https://github.com/unikraft/unikraft/pull/244

nderjung · on Feb 7, 2022

We're working hard on adding SMP support to Unikraft which is planned for the next release. The PR you have linked has all the details about the on-going work!

dang · on Feb 7, 2022

Past related threads:

Unikraft – Fast, Specialized Unikernels - https://news.ycombinator.com/item?id=26954547 - April 2021 (72 comments)

Unikraft: Posix-Like Unikernel - https://news.ycombinator.com/item?id=26142285 - Feb 2021 (10 comments)

Cut Your Cloud Computing Costs by Half with Unikraft - https://news.ycombinator.com/item?id=25431474 - Dec 2020 (5 comments)

Unikraft Unikernel Project - https://news.ycombinator.com/item?id=17439594 - July 2018 (14 comments)

moritonal · on Feb 7, 2022

"Begun the unikernel wars, have"

From my fairly naive-POV it seems like UniKernels are the next logical step in computing. Docker being the last jump and unikernels sitting to be the next with some form of WASM as a host.

mikepurvis · on Feb 7, 2022

They feel a bit more orthogonal to me, given that unikernels are about sharing a VM hypervisor, whereas containers are about safely sharing a Linux kernel.

They're solving similar problems in terms of reducing image size, startup time, security surface area, and so on. But the mechanism is quite different, so it feels like basically none of the tooling will translate. Like, where is the Kubernetes of unikernel deployments? It would have to be built from scratch, and would probably end up looking more like Terraform than like the Kubernetes of today.

nderjung · on Feb 7, 2022

It's possible to run Unikraft unikernels via kubernetes with minimal interference to the ecosystem, check out the talk at CNCF: https://www.youtube.com/watch?v=cV-xawN9_cg

We'll be launching managed kubernetes support for Unikraft unikernels soon too at https://unikraft.io

nwmcsween · on Feb 8, 2022

Will the container runtime be OSS? It would be nice to use this outside of a managed offering

mikepurvis · on Feb 7, 2022

Cool! For others interested, the k8s integration discussion starts around 17m00.

invokestatic · on Feb 7, 2022

I went with OSv (another unikernel) for a previous pet project and, while I really loved the concept, I found the tooling to be immature. This project’s tooling and documentation does looks better so I look forward to trying it out.

One thing I find missing with these unikernels though is IPSec support and Firewalls. I’d love to throw a unikernel image on DigitalOcean and have a secure software-defined IPSec tunnel.

nderjung · on Feb 7, 2022

It's possible to create an IPSec + firewall based on the Click Modular Router[0] and run this on top of Unikraft[1].

[0]: https://github.com/kohler/click/wiki/IPsecEncap (and other IPSec* elements)

[1]: https://github.com/unikraft/app-click

It could make for an interesting tutorial with a full Click-based IPSec router though! :)

convolvatron · on Feb 7, 2022

out of real curiosity - what would be the point of a firewall in a unikernel image? I mean presumably its to stop people from opening random ports. but if you bundle the application and you don't support a shell or forking processes in general then the only bound ports are those which the application explicitly opens.

so what value in requiring someone to run around the interface and open it in the firewall as well?

coredog64 · on Feb 7, 2022

You might want to let your metrics system scrape a private endpoint published on a different port. Or you might have management task that you want to restrict to your internal network. Possible if you delegate to network hardware, but sometimes those asks are a PITA.

speed_spread · on Feb 7, 2022

OSv has been around for a while, I remember looking into it five years ago. What did you feel was missing in terms of tooling?

mwcampbell · on Feb 7, 2022

I've recently become thoroughly convinced of the merits of consolidating onto as few machines (physical or virtual) as possible. One reason is that I recently consolidated some of my company's infrastructure onto a single bare-metal server to reduce costs. And then in the middle of that, this post came out:

https://rachelbythebay.com/w/2022/01/27/scale/

It seems to me that running lots of small VMs with unikernels is inherently wasteful compared to running many processes on a single machine with a shared kernel that can make optimal use of the machine's resources. Sure, the unikernel-based VMs can be smaller than equivalent Linux VMs, but one still has to allocate a fixed amount of RAM, storage, and (for public cloud platforms) CPU to each VM. We inevitably add some padding to those allocations to ensure that we have headroom, and the total probably adds up to more than we would need to allocate to a single machine (physical or virtual) running all of those processes on a single kernel. And on public cloud platforms, we have to pay for those padded resource allocations.

I've certainly done deployments with lots of small Linux VMs in the past; in my recent migration process, I was replacing such a setup with one big box. Creating lots of small VMs is certainly a convenient and robust way to independently deploy and update several components. But it's obviously not the only way.

The home page of the forthcoming Unikraft Cloud service says, "The cloud is essential to your business but you know you are overpaying." But I think a better answer is to consolidate onto a few big VMs, using container orchestration to keep deployment manageable.

nderjung · on Feb 7, 2022

It really depends on your usecase here. If the many application processes rely on the same OS libraries, versioned language runtimes (e.g. same python version), kernel version, etc. AND you trust the OS then it may make sense. However, unikernels offer the lightweightness of a container process with the security of a VM. In addition to this, memory ballooning[0][1] and other resource-elastic features are available to VMs too: allowng you to under-provision them and then later increase resources when load demands it.

Unikraft unikernels can also be managed using the same orchestration tools as containers, check out the talk at CNCF[2].

[0]: https://github.com/unikraft/unikraft/pull/219

[1]: https://pmhahn.github.io/virtio-balloon/

[2]: https://www.youtube.com/watch?v=cV-xawN9_cg

mwcampbell · on Feb 8, 2022

> memory ballooning[0][1] and other resource-elastic features are available to VMs too

If I'm using a public cloud platform's hypervisor, those features may benefit the cloud provider, but not me. Or are you targeting users running a hypervisor on bare metal?

nderjung · on Feb 8, 2022

I think it can benefit both parties -- underprovisioning for a smaller bill and then increasing when demands need it so as to prevent degregation in QoS. That said, if load increases exponentially high it can induce high costs. Some cloud providers do not offer memory (or just any resource) ballooning, like AWS, so there you will experience the problem you have discussed about over-provisioning. However, we aim to alleviate some of these problems with the uniqueness of Unikernels with our soon-to-be-released Cloud Platform at https://unikraft.io. Features like memory ballooning (with hard upper limits) and other features like deep in-kernel monitoring to understand application performance.

felipehuici · on Feb 9, 2022

Definitely agree, in fact we tend to use the term "massive consolidation", where we run thousands of VMs on a single server, thus saving costs. Unikraft unikernels are a perfect fit for this since they consume little memory and boot relatively fast. In early work [0] we were booting as many as 8000 hello world VMs/unikernels on the same server on the Xen hypervisor. More recently we have been booting 1K NGINX Unikraft images on a single server.

[0] https://dl.acm.org/doi/10.1145/3132747.3132763

fire · on Feb 8, 2022

Sort of sounds like this is a good tool to pair with things like firecracker? The goal with most of these ( aside security ) is a better ability to (bin?) pack these small "blocks" into a larger "box".

Tradeoffs between actual usage costs and devops time cost, imo

felipehuici · on Feb 8, 2022

Yes, in fact we have some early support for Firecracker, where we can at least boot some basic Unikraft images with it (e.g., see page 10 of this paper[0], FIgure 10, where we get the shortest boot times with Firecracker). We're still missing networking supporto on FC, which we're working on.

[0] https://dl.acm.org/doi/pdf/10.1145/3447786.3456248

anikuni · on Feb 7, 2022

This is an exciting project, congratulations. I'm looking forward to the docs on embedded usage, and also which languages are supported and how to configure them. For now there seems to be quite a few unikraft/app* repos with such examples.

felipehuici · on Feb 8, 2022

Agree, we're working on an embedded page, we intend to release the code during our May release. Also agree with having a page about the different languages we currently support (c/c++, Lua, Go, Python, Ruby) and others we're working on (e.g., Rust, Java). Always interested to hear which languages and/or frameworks people are interested in.

johngalt · on Feb 7, 2022

Are unikernels a performance/efficiency tool? Squeezing more nodes into a single host with minimal overhead.

Or are they a tool to achieve simplicity/elegance? Fewer moving parts to troubleshoot at the OS layer, and smaller but more formal composition.

tenebrisalietum · on Feb 7, 2022

Yep. I think this is how it works.

Kernel becomes a library the application uses instead of something that jumps over the CPU context. Application runs as root with the rump kernel or unikernel liked in. Only portions of kernel actually used need to be present. System calls become function calls. Multitasking support provided by a threading library. You shouldn't run multiple applications in a unikernel.

If you have a number of servers running a hypervisor as a base OS, and your applications on its VMs are network-centric like web servers, load balancers, database services, or microservices, and you don't really use the user-level security of a traditional OS, this can enhance performance by eliminating the user-kernel CPU context switch and consume less RAM.

birdyrooster · on Feb 7, 2022

Everyone moved to containerize their code and then security organizations in corporations have been putting the brakes on that and pushing for virtualization layer for additional isolation in multi tenant environments. Since every container ends up being a virtual machine anyways, the only way to slim down is unikernel.

pjmlp · on Feb 8, 2022

The irony of having Kata Containers or Hyper-V Isolation, because containers alone are not enough.

andai · on Feb 8, 2022

>Unikraft has been extensively evaluated in terms of performance. Evaluations of using off-the-shelf applications on Unikraft results in a 1.7x-2.7x performance improvement compared to Linux guests. In addition, Unikraft images for these apps are around 1MB, require less than 10MB of RAM to run, and boot in around 1ms on top of the VMM time (total boot time 2ms-40ms).

https://unikraft.org/docs/features/performance/

staticassertion · on Feb 7, 2022

Says it's secure, Github shows 76% of the code is in C. I see the word "secure" in a few places but it's just stated without any indication as to what about this makes it secure.

nderjung · on Feb 7, 2022

Unikraft is based on a small trusted compute base, meaning there is nothing else running with a unikernel, no ssh, no daemons, no Linux, etc.

Towards increasing security, however, we have just introduced native support for Rust[0] in Unikraft, paving the way for more internal libraries to be based on this secure and performant language.

[0]: https://github.com/unikraft/unikraft/pull/348

staticassertion · on Feb 7, 2022

Thanks, I think having a "read more" would be helpful. You do a good job of quickly demonstrating performance with some numbers, but there's nothing about security on there. I think it'd go a long way for people like me who are going to be immediately skeptical of software in C claiming to be safe.

nderjung · on Feb 7, 2022

Thanks for the feedback, we're in the process of adding a security section[0] which will detail more on the on-goings, but we'll work on adding more highlights on the main page.

I need to highlight we have separate research[1][2] which will make its way upstream soon which aims to provide hardening between internal libraries (e.g. isolating the network stack or scheduler) using gates like Intel MPK or separate hardware-accelerated services.

[0]: https://github.com/unikraft/docs/pull/32

[1]: https://project-flexos.github.io/

[2]: https://github.com/project-flexos/unikraft

staticassertion · on Feb 7, 2022

Pretty cool, will definitely read through that.

fulafel · on Feb 7, 2022

How does the system tolerate vulnerabilities outside the TCB? I thought unikernels often didn't have protections that would shield a TCB from app vulnerabilities.

felipehuici · on Feb 8, 2022

Hi, no, the statement wasn't to isolate the kernel code from the application, since it's all in the same address space. Instead, it's to reduce the possibility of bugs (but again, not in the application), and reduce the vectors for attack in the underlying stack. For separating the application from the kernel (and from components within the kernel, since Unikraft is modular) we are doing further work called FlexOS, based on Unikraft, and to appear soon at the ASPLOS conference[0]; a short version of the paper appeared at HotOS [1].

[0] https://asplos-conference.org/program/

[1]https://sigops.org/s/conferences/hotos/2021/papers/hotos21-s...

fulafel · on Feb 9, 2022

Interesting!

I found also this paper that talks about estabilishing a TCB in the unikernel which was a good companion read. https://www.ssrg.ece.vt.edu/papers/spma20.pdf

convolvatron · on Feb 7, 2022

what are you trying to protect the kernel for if it only hosts in the single application? are you assuming that local root has some distinguished privilege outside this box?

fulafel · on Feb 8, 2022

Good question, I assume there was some reason to talk about a TCB and the answer might have shed light on that as well.

Terry_Roll · on Feb 8, 2022

This is an excellent security tool by removing the attack vectors of the OS.

Who needs a lite/minimal/headless version of an OS, when you can use this instead?

Suddenly I dont need those xeon processors, a few Raspbery Pi zero's will do and the environmentalists should be happy.

Shame the SBC link is lite on information! https://unikraft.org/docs/features/embedded/

felipehuici · on Feb 8, 2022

Hi, support for the RPI and perhaps another device should be out by release 0.9 in May, along with the documentation at the link you posted (the code's working, but it needs clean-up).

Terry_Roll · on Feb 8, 2022

Its in my diary. What sort of clean up?

felipehuici · on Feb 8, 2022

Code clean up, commit history clean up, more testing and perhaps a bit of re-basing (it was built against a somewhat older version of Unikraft).

phendrenad2 · on Feb 7, 2022

Unikernels are interesting, but as long as people treat them like "linux without linux" they won't go far.

The real potential of unikernels comes from making apps that are more self-aware and take up some of the functions previously handled by linux (such as monitoring memory usage).

nderjung · on Feb 7, 2022

It works both ways with Unikraft, either bring an existing application and let it run with the added performance/security (and think it's on Linux) or write your application with our performance-oriented APIs[0][1].

[0]: https://unikraft.org/docs/concepts/architecture/

[1]: https://usoc21.unikraft.org/docs/sessions/10-high-performanc...

Youden · on Feb 7, 2022

How does one store data with Unikraft? This is the problem I hit with other unikernel projects. OSv seemed to support ZFS or NFS somehow but I couldn't quite figure out the documentation. I can't find any references to storage at all for Unikraft.

halation_effect · on Feb 7, 2022

You can use 9pfs or build upon the block device.

nderjung · on Feb 7, 2022

Yes you're right, it's a simple network-like protocol allowing you to mount a path on the host OS to the Unikraft unikernel VM similar to a container volume. In addition, Unikraft's abstract APIs[0] allow for more block devices such as EXT{2..4}, etc. which you mount in a similar way. Alternatively, you can put your filesystem into a CPIO format and mount it as initram and load it into RAM (great for performance and read-only file systems, like webservers).

[0]: https://unikraft.org/docs/concepts/architecture/

Youden · on Feb 8, 2022

Ah, I see. It looks like this is actually relatively simple but hidden away in code you'd never see until you start using Unikraft.

This is something that strikes me as an obvious question about a unikernel so I'd like to see a bigger callout in the docs.

felipehuici · on Feb 8, 2022

We also have support for EXT2/EXT4 but it's not open source yet.

halation_effect · on Feb 7, 2022

Reference paper[1].

[1] https://dl.acm.org/doi/10.1145/3447786.3456248

edsiper2 · on Feb 7, 2022

ah!

[ERROR ] GitHub rate limit exceeded! If you have not done so already,

[ERROR ] you can tell kraft to use a personal access token when contacting

[ERROR ] the GitHub API. First, visit:

nderjung · on Feb 7, 2022

Hi @edsiper2, if you are running into any problems I'm happy to help, we can chat directly on the Unikraft Discord server: https://bit.ly/UnikraftDiscord

jonpalmisc · on Feb 7, 2022

Anyone have experience using projects like this? Are the performance gains (and/or other benefits) that noticeable?

speed_spread · on Feb 7, 2022

Have a look at Seastar http://seastar.io/

Running the server in the same address space as the (uni)kernel can have major impact on performance for I/O bound apps, cutting off system calls and context switching overhead.

matthewfcarlson · on Feb 7, 2022

I’m quite curious how something like this compares to a more performance focused RPi OS like dietPI

terafo · on Feb 7, 2022

dietPI is still Linux, and all performance limitations that come with Linux are still there. They compare it with Alpine, which is much slimmer than dietPI, yet, Alpine still looses in application performance by quite a margin.

Koshkin · on Feb 7, 2022

> 166% faster

Ew. I hope they mean "2.66 times as fast."

xuhu · on Feb 7, 2022

It's 2.66 times as fast, there is a bar chart on the homepage.

Symmetry · on Feb 7, 2022

I presume that's for particular benchmarks where syscall overhead is significant. Which is certainly true for some real world applications but not for others.

dantodor · on Feb 8, 2022

How does it compare with nanovms ?

felipehuici · on Feb 8, 2022

For a while nanovms was based on Rump[0], which had terrible performance. The new version of nanovms[1] we haven't benchmarked but we should; having said that, even the founder says they "[...] have spent very little time benchmarking" [2]

[0] https://en.wikipedia.org/wiki/Rump_kernel

[1] https://github.com/nanovms/nanos

[2] https://www.gula.tech/blog/files/135c832f92668268f1e9140a524...