forked from rrcarlosr/Jetpack
277 lines
12 KiB
Plaintext
277 lines
12 KiB
Plaintext
|
Booting Linux on x86 with FIT
|
||
|
=============================
|
||
|
|
||
|
Background
|
||
|
----------
|
||
|
|
||
|
(corrections to the text below are welcome)
|
||
|
|
||
|
Generally Linux x86 uses its own very complex booting method. There is a setup
|
||
|
binary which contains all sorts of parameters and a compressed self-extracting
|
||
|
binary for the kernel itself, often with a small built-in serial driver to
|
||
|
display decompression progress.
|
||
|
|
||
|
The x86 CPU has various processor modes. I am no expert on these, but my
|
||
|
understanding is that an x86 CPU (even a really new one) starts up in a 16-bit
|
||
|
'real' mode where only 1MB of memory is visible, moves to 32-bit 'protected'
|
||
|
mode where 4GB is visible (or more with special memory access techniques) and
|
||
|
then to 64-bit 'long' mode if 64-bit execution is required.
|
||
|
|
||
|
Partly the self-extracting nature of Linux was introduced to cope with boot
|
||
|
loaders that were barely capable of loading anything. Even changing to 32-bit
|
||
|
mode was something of a challenge, so putting this logic in the kernel seemed
|
||
|
to make sense.
|
||
|
|
||
|
Bit by bit more and more logic has been added to this post-boot pre-Linux
|
||
|
wrapper:
|
||
|
|
||
|
- Changing to 32-bit mode
|
||
|
- Decompression
|
||
|
- Serial output (with drivers for various chips)
|
||
|
- Load address randomisation
|
||
|
- Elf loader complete with relocation (for the above)
|
||
|
- Random number generator via 3 methods (again for the above)
|
||
|
- Some sort of EFI mini-loader (1000+ glorious lines of code)
|
||
|
- Locating and tacking on a device tree and ramdisk
|
||
|
|
||
|
To my mind, if you sit back and look at things from first principles, this
|
||
|
doesn't make a huge amount of sense. Any boot loader worth its salts already
|
||
|
has most of the above features and more besides. The boot loader already knows
|
||
|
the layout of memory, has a serial driver, can decompress things, includes an
|
||
|
ELF loader and supports device tree and ramdisks. The decision to duplicate
|
||
|
all these features in a Linux wrapper caters for the lowest common
|
||
|
denominator: a boot loader which consists of a BIOS call to load something off
|
||
|
disk, followed by a jmp instruction.
|
||
|
|
||
|
(Aside: On ARM systems, we worry that the boot loader won't know where to load
|
||
|
the kernel. It might be easier to just provide that information in the image,
|
||
|
or in the boot loader rather than adding a self-relocator to put it in the
|
||
|
right place. Or just use ELF?
|
||
|
|
||
|
As a result, the x86 kernel boot process is needlessly complex. The file
|
||
|
format is also complex, and obfuscates the contents to a degree that it is
|
||
|
quite a challenge to extract anything from it. This bzImage format has become
|
||
|
so prevalent that is actually isn't possible to produce the 'raw' kernel build
|
||
|
outputs with the standard Makefile (as it is on ARM for example, at least at
|
||
|
the time of writing).
|
||
|
|
||
|
This document describes an alternative boot process which uses simple raw
|
||
|
images which are loaded into the right place by the boot loader and then
|
||
|
executed.
|
||
|
|
||
|
|
||
|
Build the kernel
|
||
|
----------------
|
||
|
|
||
|
Note: these instructions assume a 32-bit kernel. U-Boot does not currently
|
||
|
support booting a 64-bit kernel as it has no way of going into 64-bit mode on
|
||
|
x86.
|
||
|
|
||
|
You can build the kernel as normal with 'make'. This will create a file called
|
||
|
'vmlinux'. This is a standard ELF file and you can look at it if you like:
|
||
|
|
||
|
$ objdump -h vmlinux
|
||
|
|
||
|
vmlinux: file format elf32-i386
|
||
|
|
||
|
Sections:
|
||
|
Idx Name Size VMA LMA File off Algn
|
||
|
0 .text 00416850 81000000 01000000 00001000 2**5
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, CODE
|
||
|
1 .notes 00000024 81416850 01416850 00417850 2**2
|
||
|
CONTENTS, ALLOC, LOAD, READONLY, CODE
|
||
|
2 __ex_table 00000c50 81416880 01416880 00417880 2**3
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
3 .rodata 00154b9e 81418000 01418000 00419000 2**5
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
4 __bug_table 0000597c 8156cba0 0156cba0 0056dba0 2**0
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
5 .pci_fixup 00001b80 8157251c 0157251c 0057351c 2**2
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
6 .tracedata 00000024 8157409c 0157409c 0057509c 2**0
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
7 __ksymtab 00007ec0 815740c0 015740c0 005750c0 2**2
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
8 __ksymtab_gpl 00004a28 8157bf80 0157bf80 0057cf80 2**2
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
9 __ksymtab_strings 0001d6fc 815809a8 015809a8 005819a8 2**0
|
||
|
CONTENTS, ALLOC, LOAD, READONLY, DATA
|
||
|
10 __init_rodata 00001c3c 8159e0a4 0159e0a4 0059f0a4 2**2
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
11 __param 00000ff0 8159fce0 0159fce0 005a0ce0 2**2
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
12 __modver 00000330 815a0cd0 015a0cd0 005a1cd0 2**2
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
13 .data 00063000 815a1000 015a1000 005a2000 2**12
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, DATA
|
||
|
14 .init.text 0002f104 81604000 01604000 00605000 2**2
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, CODE
|
||
|
15 .init.data 00040cdc 81634000 01634000 00635000 2**12
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, DATA
|
||
|
16 .x86_cpu_dev.init 0000001c 81674cdc 01674cdc 00675cdc 2**2
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
17 .altinstructions 0000267c 81674cf8 01674cf8 00675cf8 2**0
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
18 .altinstr_replacement 00000942 81677374 01677374 00678374 2**0
|
||
|
CONTENTS, ALLOC, LOAD, READONLY, CODE
|
||
|
19 .iommu_table 00000014 81677cb8 01677cb8 00678cb8 2**2
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
20 .apicdrivers 00000004 81677cd0 01677cd0 00678cd0 2**2
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, DATA
|
||
|
21 .exit.text 00001a80 81677cd8 01677cd8 00678cd8 2**0
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, CODE
|
||
|
22 .data..percpu 00007880 8167a000 0167a000 0067b000 2**12
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, DATA
|
||
|
23 .smp_locks 00003000 81682000 01682000 00683000 2**2
|
||
|
CONTENTS, ALLOC, LOAD, RELOC, READONLY, DATA
|
||
|
24 .bss 000a1000 81685000 01685000 00686000 2**12
|
||
|
ALLOC
|
||
|
25 .brk 00424000 81726000 01726000 00686000 2**0
|
||
|
ALLOC
|
||
|
26 .comment 00000049 00000000 00000000 00686000 2**0
|
||
|
CONTENTS, READONLY
|
||
|
27 .GCC.command.line 0003e055 00000000 00000000 00686049 2**0
|
||
|
CONTENTS, READONLY
|
||
|
28 .debug_aranges 0000f4c8 00000000 00000000 006c40a0 2**3
|
||
|
CONTENTS, RELOC, READONLY, DEBUGGING
|
||
|
29 .debug_info 0440b0df 00000000 00000000 006d3568 2**0
|
||
|
CONTENTS, RELOC, READONLY, DEBUGGING
|
||
|
30 .debug_abbrev 0022a83b 00000000 00000000 04ade647 2**0
|
||
|
CONTENTS, READONLY, DEBUGGING
|
||
|
31 .debug_line 004ead0d 00000000 00000000 04d08e82 2**0
|
||
|
CONTENTS, RELOC, READONLY, DEBUGGING
|
||
|
32 .debug_frame 0010a960 00000000 00000000 051f3b90 2**2
|
||
|
CONTENTS, RELOC, READONLY, DEBUGGING
|
||
|
33 .debug_str 001b442d 00000000 00000000 052fe4f0 2**0
|
||
|
CONTENTS, READONLY, DEBUGGING
|
||
|
34 .debug_loc 007c7fa9 00000000 00000000 054b291d 2**0
|
||
|
CONTENTS, RELOC, READONLY, DEBUGGING
|
||
|
35 .debug_ranges 00098828 00000000 00000000 05c7a8c8 2**3
|
||
|
CONTENTS, RELOC, READONLY, DEBUGGING
|
||
|
|
||
|
There is also the setup binary mentioned earlier. This is at
|
||
|
arch/x86/boot/setup.bin and is about 12KB in size. It includes the command
|
||
|
line and various settings need by the kernel. Arguably the boot loader should
|
||
|
provide all of this also, but setting it up is some complex that the kernel
|
||
|
helps by providing a head start.
|
||
|
|
||
|
As you can see the code loads to address 0x01000000 and everything else
|
||
|
follows after that. We could load this image using the 'bootelf' command but
|
||
|
we would still need to provide the setup binary. This is not supported by
|
||
|
U-Boot although I suppose you could mostly script it. This would permit the
|
||
|
use of a relocatable kernel.
|
||
|
|
||
|
All we need to boot is the vmlinux file and the setup.bin file.
|
||
|
|
||
|
|
||
|
Create a FIT
|
||
|
------------
|
||
|
|
||
|
To create a FIT you will need a source file describing what should go in the
|
||
|
FIT. See kernel.its for an example for x86. Put this into a file called
|
||
|
image.its.
|
||
|
|
||
|
Note that setup is loaded to the special address of 0x90000 (a special address
|
||
|
you just have to know) and the kernel is loaded to 0x01000000 (the address you
|
||
|
saw above). This means that you will need to load your FIT to a different
|
||
|
address so that U-Boot doesn't overwrite it when decompressing. Something like
|
||
|
0x02000000 will do so you can set CONFIG_SYS_LOAD_ADDR to that.
|
||
|
|
||
|
In that example the kernel is compressed with lzo. Also we need to provide a
|
||
|
flat binary, not an ELF. So the steps needed to set things are are:
|
||
|
|
||
|
# Create a flat binary
|
||
|
objcopy -O binary vmlinux vmlinux.bin
|
||
|
|
||
|
# Compress it into LZO format
|
||
|
lzop vmlinux.bin
|
||
|
|
||
|
# Build a FIT image
|
||
|
mkimage -f image.its image.fit
|
||
|
|
||
|
(be careful to run the mkimage from your U-Boot tools directory since it
|
||
|
will have x86_setup support.)
|
||
|
|
||
|
You can take a look at the resulting fit file if you like:
|
||
|
|
||
|
$ dumpimage -l image.fit
|
||
|
FIT description: Simple image with single Linux kernel on x86
|
||
|
Created: Tue Oct 7 10:57:24 2014
|
||
|
Image 0 (kernel@1)
|
||
|
Description: Vanilla Linux kernel
|
||
|
Created: Tue Oct 7 10:57:24 2014
|
||
|
Type: Kernel Image
|
||
|
Compression: lzo compressed
|
||
|
Data Size: 4591767 Bytes = 4484.15 kB = 4.38 MB
|
||
|
Architecture: Intel x86
|
||
|
OS: Linux
|
||
|
Load Address: 0x01000000
|
||
|
Entry Point: 0x00000000
|
||
|
Hash algo: sha1
|
||
|
Hash value: 446b5163ebfe0fb6ee20cbb7a8501b263cd92392
|
||
|
Image 1 (setup@1)
|
||
|
Description: Linux setup.bin
|
||
|
Created: Tue Oct 7 10:57:24 2014
|
||
|
Type: x86 setup.bin
|
||
|
Compression: uncompressed
|
||
|
Data Size: 12912 Bytes = 12.61 kB = 0.01 MB
|
||
|
Hash algo: sha1
|
||
|
Hash value: a1f2099cf47ff9816236cd534c77af86e713faad
|
||
|
Default Configuration: 'config@1'
|
||
|
Configuration 0 (config@1)
|
||
|
Description: Boot Linux kernel
|
||
|
Kernel: kernel@1
|
||
|
|
||
|
|
||
|
Booting the FIT
|
||
|
---------------
|
||
|
|
||
|
To make it boot you need to load it and then use 'bootm' to boot it. A
|
||
|
suitable script to do this from a network server is:
|
||
|
|
||
|
bootp
|
||
|
tftp image.fit
|
||
|
bootm
|
||
|
|
||
|
This will load the image from the network and boot it. The command line (from
|
||
|
the 'bootargs' environment variable) will be passed to the kernel.
|
||
|
|
||
|
If you want a ramdisk you can add it as normal with FIT. If you want a device
|
||
|
tree then x86 doesn't normally use those - it has ACPI instead.
|
||
|
|
||
|
|
||
|
Why Bother?
|
||
|
-----------
|
||
|
|
||
|
1. It demystifies the process of booting an x86 kernel
|
||
|
2. It allows use of the standard U-Boot boot file format
|
||
|
3. It allows U-Boot to perform decompression - problems will provide an error
|
||
|
message and you are still in the boot loader. It is possible to investigate.
|
||
|
4. It avoids all the pre-loader code in the kernel which is quite complex to
|
||
|
follow
|
||
|
5. You can use verified/secure boot and other features which haven't yet been
|
||
|
added to the pre-Linux
|
||
|
6. It makes x86 more like other architectures in the way it boots a kernel.
|
||
|
You can potentially use the same file format for the kernel, and the same
|
||
|
procedure for building and packaging it.
|
||
|
|
||
|
|
||
|
References
|
||
|
----------
|
||
|
|
||
|
In the Linux kernel, Documentation/x86/boot.txt defines the boot protocol for
|
||
|
the kernel including the setup.bin format. This is handled in U-Boot in
|
||
|
arch/x86/lib/zimage.c and arch/x86/lib/bootm.c.
|
||
|
|
||
|
The procedure for entering 64-bit mode on x86 seems to be described here:
|
||
|
|
||
|
http://wiki.osdev.org/64-bit_Higher_Half_Kernel_with_GRUB_2
|
||
|
|
||
|
Various files in the same directory as this file describe the FIT format.
|
||
|
|
||
|
|
||
|
--
|
||
|
Simon Glass
|
||
|
sjg@chromium.org
|
||
|
7-Oct-2014
|