From Fedora Project Wiki

Fedora Classroom - Package Taxonomy and Techniques - Ignacio Vazquez-Abrams - Saturday, November 7, 2008

IRC Log of the Class

 23:00 -!- nirik changed the topic of #fedora-classroom to: Fedora Classroom - Package Taxonomy and Techniques
 with your teacher: ivazquez  - See https://fedoraproject.org/wiki/Communicate/IRC/Classroom for more info
 23:00 < Ineluctable> thanks stickster
 23:00 < stickster> neverho0d: It is, but you could work with another person to edit your summaries
 23:00  * kdn *bell rings*
 23:00 < brunowolff> If it isn't, then translating it could be another area to help out.
 23:01 < neverho0d> stickster: thank you
 23:01 < stickster> brunowolff: +1, exactly
 23:01  * stickster really scoots now
 23:01 < ivazquez> Hello everyone, and thank you for coming.
 23:01 < mattia> hi ivazquez
 23:01 < Abd4llA> Helloo Mr ivazquez :P
 23:01 < neverho0d> brunowolff: I'll be glad to contribute as much as I can
 23:01 < thomasj> helloiva
 23:01 < thomasj> ops
 23:01 < ivazquez> You see them go by all the time in anaconda and PackageKit.
 23:01 < kiakli> hi
 23:02 < cga> ciao ivazquez, i'm here too ;)
 23:02 < thomasj> hello ivazquez
 23:02 < ivazquez> libfoo-3.2-5.fc9... barprogs-5.2.6-0.cvs20081016...
 23:02 < ivazquez> But just what is it that makes a package...
 23:02 < ivazquez> ... a package?
 23:02 < domg472> rpmbuil
 23:02 < domg472> nvm
 23:03 < brunowolff> A spec file.
 23:03 < ivazquez> At its simplest, a package is a set of files, and metadata about each file as well as
 about the entire bundle.
 23:03 < kdn> clean installs, clean uninstalls, clean upgrades?
 23:03 < ivazquez> Things like permissions, ownership, scripts executed during installation, etc.
 23:04 < ivazquez> So let's spend some time looking at these things.
 23:04 < ivazquez> The most important thing you need to know about a package is its NEVRA.
 23:04 < ivazquez> This stands for Name, Epoch, Version, Release, and Architecture.
 23:04 < ivazquez> It's what determines when a package can be installed or upgraded.
 23:05 < ivazquez> The name is the part we're most familiar with.
 23:05 < ivazquez> glibc, firefox, etc.
 23:05 < ivazquez> It is the smallest part of a package we can use and still talk about the package itself.
 23:06 < ivazquez> The epoch, version, and release determine the "order" of packages.
 23:06 < ivazquez> That is to say, which package is considered "newer" than others.
 23:07 < ivazquez> Normally we don't need to worry about the epoch, so we'll hold out on that one for just a
 moment.
 23:07 < ivazquez> Version is mostly self-explanatory.
 23:07 < ivazquez> 2.8, 3.0.2, etc.
 23:07 < ivazquez> When upstream releases software, they give it a version to keep it separate from all the
 other versions they release.
 23:08 < ivazquez> The first part of a package that we really get to handle as a distro is the release.
 23:08 < EvilBob> how should a packager deal with packaging a beta or prerelease version from upstream?
 23:09 < ivazquez> That is actually covered in Fedora's packaging guidelines.
 23:09 < ivazquez> I'll be happy to discuss packaging issues after.
 23:09 <@nirik> http://fedoraproject.org/wiki/Packaging/NamingGuidelines#Pre-Release_packages
 23:09 < ivazquez> The release is used to distinguish one package of a specific version from another.
 23:10 < ivazquez> So you package libfoo 3.2, and the first package gets a release of "1".
 23:10 < ivazquez> Then let's say someone notices a problem in your package, and you fix it.
 23:11 < ivazquez> When you go to fix it, you keep the version the same (since it's still the same code from
 upstream), but you bump the release up to "2".
 23:11 < ivazquez> That way people and tools realize that libfoo-3.2-2 is an update for libfoo-3.2-1.
 23:12 < ivazquez> And when upstream releases a new version, say, libfoo 3.3, you can drop the release back
 down to "1".
 23:12 < ivazquez> libfoo-3.3-1 is seen as an update to libfoo-3.2-2 since the version is higher.
 23:13 < brunowolff> The kernel seems to keep an increase release number even after minor releases?
 23:13 < ivazquez> It used to. They fixed that.
 23:13 < ivazquez> The exact details involve all sorts of ugliness regarding the "base" commit.
 23:14 < ivazquez> We can come back to that later.
 23:14 < brunowolff> When 2.27.5 was released it had -88 for the release number?
 23:14 < ivazquez> Now, epoch.
 23:14 < brunowolff> OK
 23:14 < ivazquez> The epoch is the rarest-seen and the most troublesome part of a package.
 23:15 < ivazquez> By default it's not shown when you use rpm to query a package.
 23:15 < ivazquez> (and we'll come back to querying after)
 23:15 < zcat> if you have rpmdevtools installed you can play with the version/epoch comparisons.
 rpmdev-vercmp libfoo-3.2-2 libfoo-3.2-1
 23:16 < ivazquez> Additionally, when writing a spec file, you must include this invisible epoch in listing
 packages that you are dependent upon.
 23:16 < ivazquez> Many a packager has been bitten by omitting the epoch in this scenario.
 23:16 < ivazquez> Now, why would anyone use the epoch, given all this trouble?
 23:17 < ivazquez> The simple answer is that the version has had issues.
 23:18 < ivazquez> For instance, going from, say, "3.2preview" to "3.2final" will be a problem, since 3.2preview
 actually looks newer than 3.2final.
 23:19 < cga> how comes?
 23:19 < ivazquez> So what must be done is the epoch must be bumped to 1 (it is 0 by default), and everyone
 involved lets out a collective groan because of the issues mentioned earlier.
 23:19 < thomasj> cga, p - f
 23:19 < Sid> because 2.3preview comes later in an alphabetical listing than 2.3final
 23:19 < ivazquez> Strings in a VR are compared lexicographically.
 23:19 < Sid> 3.2 even
 23:19 < cga> i see thanks
 23:19 < EvilBob> cga: alphanumeric ordering
 23:19 < cga> fair enough
 23:20 < ivazquez> This brings us to the fifth part, the architecture.
 23:20 < ivazquez> You would think that this is a very small topic.
 23:20 < ivazquez> After all, i386 goes on i386 machines, x86_64 on x86_64, and so on.
 23:21 < ivazquez> Unfortunately this is not true.
 23:21 < ivazquez> And this is for 2 reasons.
 23:21 < ivazquez> The first is that there are architectures that are "equivalent" to each other.
 23:22 < ivazquez> E.g., i386, i486, i586, i686, etc.
 23:22 < ivazquez> You can only have a single package (with exceptions we'll cover later) with the same NA
 installed at a time.
 23:23 < ivazquez> So you can't have libfoo.i386 installed at the same time as libfoo.i686.
 23:23 < ivazquez> The second is a little feature called "multilib".
 23:24 < ivazquez> This allows packages from different equivalent architectures to be installed at the same time.
 23:24 < ivazquez> It's what lets you install glibc.x86_64 and glibc.i686 on the same system.
 23:25 < ivazquez> However, it also has its issues.
 23:25 < Discordian> Doesn't that increase the memory residency?
 23:26 < ivazquez> One of the issues is when the x86_64 package and the ix86 (i386, i686, etc.) package contain
 the same files.
 23:26 < ivazquez> Sorry, memory residency?
 23:26 < Discordian> Don't shared libs take up memory?
 23:27 < Discordian> Sorry ignore that please don't want to distract you
 23:27 < ivazquez> Only when they're loaded. Packages on the disk take up no memory until they're requested.
 23:27 < ivazquez> Alright, same files.
 23:28 < cga> Discordian: see differences between application and process (ie. installed versus running)
 23:28 < ivazquez> rpm does not have a problem with the x86_64 package and the ix86 package containing the same
 files, provided that the 
 files are *exactly* the same.
 23:28 < ivazquez> Timestamp, contents, permissions, etc.
 23:28 < Discordian> cga: thank you
 23:28 < ivazquez> But even this is not true.
 23:28 < ivazquez> And this is where we get into trouble.
 23:29 < ivazquez> If the x86_64 package and the ix86 package are installed in the same "transaction",
 rpm is perfectly willing to overlook differences in the files and will install both packages, with the x86_64
 files overwriting the ix86 files.
 23:29 < ivazquez> HOWEVER.
 23:30 < ivazquez> If either one of the packages is already installed, and we try to install the other package,
 rpm will complain about file conflicts since they don't match.
 23:31 < ivazquez> This sort of issue has spawned many more bug reports than are deserved.
 23:31 < ivazquez> So, something to keep in mind when you install or build packages.
 23:32 < ivazquez> So, now that we understand (I hope) these 5 elements, let's look at what else a package's
 metadata provides.
 23:33 <@nirik> I have a question: where does rpm store the ix86 binaries? If the x86_64 package is removed
 it would have to place the ix86 ones back right?
 23:33 < ivazquez> It does not. It simply doesn't erase the files provided by the x86_64 package.
 23:33 < ivazquez> Yay multilib.
 23:34 <@nirik> yikes. ;(
 23:34 < Discordian> Does that mean you could rebuild both i386 and x64 rpms using rpmrebuild and they'd both
 be valid?
 23:35 < ivazquez> I don't know. I've never looked at how rpmrebuild works.
 23:35 < Discordian> My impression is it just uses the rpm db to reconstruct the rpm file
 23:36 < ivazquez> I'm glad you bring up the rpmdb.
 23:36 < ivazquez> A package in a file has a set of metadata.
 23:36 < ivazquez> When said package is installed, the metadata needs to be stored in order to retain
 information about the package.
 23:37 < ivazquez> The metadata is stored in what is called the rpmdb.
 23:37 < ivazquez> We can query the rpmdb via "rpm -q".
 23:38 < ivazquez> We pass it a name, and rpm will give us the metadata in a pre-programmed format.
 23:38 < ivazquez> Normally it looks like <name>-<version>-<release>.
 23:38 < ivazquez> I'm not sure if they've added .<arch> to that recently. I believe so though.
 23:39 < ivazquez> You can see an example by opening up a terminal and typing "rpm -q filesystem".
 23:39 < Discordian> That is the case on F9 yes
 23:39 < ivazquez> If someone could provide the output from theirs please?
 23:40 < Sid> filesystem-2.4.13-1.fc9.i386
 23:40 < ivazquez> Right, there we are.
 23:40 < ivazquez> The name is, of course, "filesystem".
 23:41 < ivazquez> The version is "2.4.13", the release is "1.fc9", and the arch is "i386".
 23:41 < EvilBob> [root@mediapc1 ~]# rpm -q filesystem
 23:41 < EvilBob> filesystem-2.4.13-1.fc9.i386
 23:41 < EvilBob> grrr
 23:41 < ivazquez> Sometimes we need more or less than is provided by the default format.
 23:41 < zcat> ah. so showing the arch is default now? i've been adding 
 "%_query_all_fmt  %%{name}-%%{version}-%%{release}.%%{arch}" to ~/.rpmmacros so i didn't notice
 23:41 < EvilBob> filesystem-2.4.11-1.fc8
 23:41 < Discordian> filesystem-2.4.13-1.fc9.i386 is what I have
 23:42 < ivazquez> Right, and we're get to query formats right now.
 23:42 < ivazquez> *getting
 23:42 < ivazquez> Let's say that you have a script that verifies the version of certain packages on
 your system.
 23:42 < ivazquez> You want to narrow down your focus to only the version.
 23:43 < ivazquez> Normally you'd do post-processing of the output of "rpm -q".
 23:43 < ivazquez> But the default output can be changed, as demonstrated by zcat just above.
 23:43 < Discordian> you mean using awk or perl?
 23:44 < ivazquez> Correct.
 23:44 < ivazquez> What you can do is you can pass rpm a parameter that tells it the format you want.
 23:44 < Discordian> cool I can see that would be useful
 23:44 < ivazquez> This argument is "--qf", and it takes the format in a special, err, format.
 23:45 < ivazquez> Let's deal with just the version for now.
 23:45 < ivazquez> To get the version of filesystem we run "rpm -q --qf '%{version}'".
 23:45 < ivazquez> This will return just the value, with no fancy formatting.
 23:46 < ivazquez> Naturally if the output will be consumed by a person we'll at least want to put it on its
 own line.
 23:46 < ivazquez> The query format string takes several C-style escapes, including \n for a newline.
 23:47 < nuonguy> you mean rpm -q filesystem --qf '%{version}' right?
 23:47 < ivazquez> "rpm -q --qf '%{version}\n'" will give us something that we can deal with easier.
 23:47 < ivazquez> Yes.
 23:47 < neverho0d> rpm -q --qf '%{version}\n' filesystem
 23:47 < ivazquez> Right.
 23:48 < ivazquez> There are a large number of these "query tags" built into rpm.
 23:48 < ivazquez> You can see a full list by running "rpm --querytags | less".
 23:48 < ivazquez> The query tags in the query format are not case-sensitive.
 23:48 < neverho0d> wow 163!
 23:49 < ivazquez> Most of them you'll never need though.
 23:49 < daMaestro> and shortcuts for those that don't want to type ;-)
 23:49 < ivazquez> Among the important ones are the 5 parts that we started with here:
 23:50 < ivazquez> name, epoch, version, release, arch.
 23:50 < ivazquez> Also important is sourcerpm, which gives you the name of the component in Bugzilla to file
 bugs against.
 23:51 < ivazquez> installtime is also useful.
 23:51 < ivazquez> But there are 2 things to know about it.
 23:51 < ivazquez> 1) It only makes sense in the rpmdb. Querying it from a package in a file makes no sense.
 23:51 < ivazquez> And we'll get to querying a package in a file in a moment.
 23:52 < ivazquez> 2) It gives its result as a *nix timestamp. Not exactly human-friendly.
 23:53 < ivazquez> So "rpm -q --qf '%{name}: %{installtime}\n' filesystem", while technically correct, isn't
 all that useful.
 23:53 < ivazquez> Fortunately query tags support a suffix that you can add to modify them.
 23:54 < ivazquez> The suffix ":date" will convert a timestamp into a human-readable date and time.
 23:54 < ivazquez> So with a small change we get "rpm -q --qf '%{name}: %{installtime:date}\n' filesystem",
 and we can see more clearly when our system was installed or upgraded.
 23:56 < ivazquez> Another query tag of interest is dsaheader.
 23:56 < ivazquez> This tag tells us how, when, and who signed a package.
 23:56 < ivazquez> We'll get into signing packages a bit later.
 23:57 < Discordian> Cool, this is good stuff
 23:57 < cga> night all, this is interesting but i'm falling asleep. i'll read the logs tomorrow.
 i find the classroom idea very interesting. kudos.
 23:57 < ivazquez> Take care.
 23:57 < ivazquez> Again, it spews a long hex string that will make our brains asplode if we try to consume it.
 23:57  * nirik didn't know the :date stuff. very cool. 
 23:58 < ivazquez> Suffixing it with ":pgpsig" processes it and gives us a result we can use.
 23:58 < daMaestro> yes, i just wasted a few moments trying to pipe the unix timestamp to data -d via xargs
 23:58 < daMaestro> thanks.
 23:58 < kdn> Thanks to all.  This was a Good Thing(tm).
 23:58  * delhage agrees
 23:58 < ivazquez> rpm -q --qf '%{name}: %{dsaheader:pgpsig}\n' filesystem
 23:58 < daMaestro> it's not over
 23:58 < Discordian> I'm not going
 23:58 < ivazquez> I'll leave it as an exercise for the reader as to whose key that is ;)
 23:59 < Discordian> heh
 23:59 < brunowolff> How well do you have to know the code to package something from upstream?
 23:59  * nirik notes that this is the last class scheduled today... more classess tomorrow. 
 23:59 < ivazquez> Almost not at all.
 23:59 < domg472> why --qf and not -qf, would that conflict?
 00:00 < brunowolff> So a good working relationship with upstream would be good enough?
 00:00 < ivazquez> -qf is seen as a combination of 2 flags, -q and -f.
 00:00 < domg472> i see
 00:00 < Discordian> -qf is which package does this file come from
 00:00 < ivazquez> brunowolff: Not even. There are a number of instances where upstream is not even aware
 of it.
 00:00 < ivazquez> Okay, packages on disk.
 00:00 < Sid> is it possible to query packages that aren't installed?
 00:01 < ivazquez> Yes it is.
 00:01 < ivazquez> There isn't much to say about packages on disk, other than you need to add -p to rpm.
 00:01 < ivazquez> And of course, some tags such as installtime don't make sense.
 00:01 < brunowolff> Is there a good place to get info on how to package java apps? I have one I build from
 ant locally, but I don't know where fedora wants
 00:02 < ivazquez> After all, a package on disk isn't considered to be installed.
 00:02 < brunowolff> java apps placed or if there are any particular build practices to use.
 00:02 < neverho0d> ivazquez: it would be helpful to explain a package breaking on docs, devel and other things....
 00:02 < Sid> ok, what if the package isn't on disk but simply in the yum repo, can I then query it?
 00:02 < ivazquez> brunowolff: Fedora has an extensive set of packaging guidelines in the wiki.
 00:02 < Sid> or do I at least need to download the .rpm?
 00:02 < ivazquez> Another excellent question.
 00:02 < ivazquez> neverho0d: We'll handle that in a bit.
 00:03 < ivazquez> Packages in a repo.
 00:03 < neverho0d> ivazquez: ok. thnks
 00:03 < ivazquez> yum-utils has a script, repoquery, which can be used to query packages in a repo.
 00:03 <@nirik> brunowolff: https://fedoraproject.org/wiki/Packaging/Java
 00:04 < ivazquez> It takes a subset of the query tags that rpm does, due to the fact that the repo metadata has
 only a small fraction of the package metadata.
 00:04 < ivazquez> Other than that it takes -q and --qf just like rpm.
 00:05 < ivazquez> Alright, I think I've chewed up enough time on my bit. Which topics did I say I'd defer?
 00:05 < brunowolff> nirik: Thanks that looks pretty readable. I'll see if I can make it work locally. There are
 other issues with packaging the app (Colossus) that I am interested in.
 00:05  * nirik is sad that it doesn't provide sourcerpm. ;( 
 00:05 < domg472> thanks
 00:05 < domg472> very informative day
 00:06 < ivazquez> Package breakup.
 00:06 < Discordian> Thank you very much
 00:06 < erinlea80> Thanks ivazquez :) Its been informative.
 00:06 < Discordian> I learned a lot
 00:06 < ivazquez> Not quite done yet, but I understand if you have other things to do.
 00:06 < mattia> thanks ivazquez
 00:06 < Discordian> No no I'll stay
 00:06 < Bugz> thanks ivazquez
 00:06 < ivazquez> Part of the metadata in a package is what "type" a file is.
 00:07 < ivazquez> That is to say, whether it's a normal file, a configuration file, or documentation.
 00:08 < Discordian> or a script?
 00:08 < ivazquez> rpm allows you to query only these specific files by passing -c or -d, but most of the query
 tags are at the package level and so don't apply.
 00:08 < ivazquez> Package scripts are actually stored separately.
 00:08 < ivazquez> They're in the metadata, not the files.
 00:08 < Discordian> Okay I've wondered about that
 00:09 < ivazquez> They're viewable as either any of the *prog query tags, or by passing --scripts when querying
 a package.
 00:09 < domg472> rpm -q --scripts
 00:09 < Discordian> Thank you
 00:09 < ivazquez> No, I lie. Not the *prog tags.
 00:10 < ivazquez> Those are what is used to actually run the scripts.
 00:10 < ivazquez> The scripts themselves are in the prein, postin, preun, postun, and other related tags.
 00:11 < ivazquez> Okay, I can't seem to find what other audience topics I decided to defer.
 00:11 < ivazquez> Does anyone have any questions?]
 00:12 < domg472> it was all clear except the x86 x86_64 bit but that must be my fault
 00:12 < nuonguy> is there a virtual file system like /selinux or /proc to browse the rpm database?
 00:12 < domg472> ill digginto that myself
 00:12 < Discordian> since this is Fedora : what impact will the new version of rpm have?
 00:12 < nuonguy> domg472: +1
 00:12 < Discordian> or is that unfair?
 00:12 < ivazquez> No. The rpmdb is stored as a set of bdb or sqlite databases in /var/lib/rpm.
 00:13 < erinlea80> My questions were answered up above.
 00:13 < ivazquez> Which one, rpm5?
 00:13  * nirik muses that someone could make a fuse-rpmdb someday... 
 00:13 < Discordian> Yeah
 00:13 < domg472> by the way nuonguy id like to hear what is your reason to contribute to fedora or oss
 in general?
 00:14 < ivazquez> rpm5 is an independent project. I cannot speak for Red Hat, but it has little or no
 impact in Fedora as far as I can see.
 00:14 < ivazquez> the rpm maintainers in RH could probably give you a more complete answer on that one.
 00:15 < Discordian> Okay I noted that F10 had flagged up a new version of rpm
 00:15 < Discordian> I think perhaps it's 4.x
 00:15 < ivazquez> That would be further development in the 4.x branch.
 00:15 < Discordian> Yes
 00:15 <@nirik> yes, 4.6.0
 00:15 < sdodson> F10 will use 4.6.
 00:15 < sdodson> Though, I believe it's been requested that newer features of 4.6 not actually be used until
 a later date.
 00:16 < Discordian> Ahhh cool
 00:16 < ivazquez> Anyone else?
 00:16 <@nirik> thanks ivazquez. Great session!
 00:16 < ivazquez> In that case, thank you for coming, and thank you for listening.
 00:17 < Discordian> yes thank you very much ivazquez
 00:17 < nuonguy> domg472: when I interview candidates, I ask them if they contribute to any open source projects
 00:17  * erinlea80 applauds
 00:17 < neverho0d> ivazquez: may be one another class om spec-file building? ;)