Hardly obscure, every modern OS has an equivalent feature, but only OSX and Wind...

amyjess · on July 20, 2016

> feature that almost nobody uses because it has a shitty non-file based API that also breaks most tools unless they are specifically aware of them: extended attributes

Mind you, OS X makes extensive use of extended attributes in addition to resource forks (and it's largely deprecated resource forks in favor of app folders). Spend some time poking around Siracusa's reviews (since Tiger); he loves to go into detail about every new way Apple makes use of extended attributes.

Also, it's not fair to say that almost nobody uses them. Chrome makes use of extended attributes, as does KDE's metadata system and a few other things.

> (limiting their size to strings that will fit in RAM)

That's an understatement. The Linux kernel API limits the size of all extended attributes to 64KB, and the most popular filesystems limit them further to 4KB. That's not really comparable to a true fork.

ZFS is the exception: its extended attributes are implemented as forks, and the maximum size of an extended attribute is the same as that of a file. Unfortunately, those aren't accessible on ZOL because the kernel won't support it, so you can really only take advantage of it on Solaris/Illumos (and maybe FreeBSD?).

variant · on July 20, 2016

Does OSX use extended attributes to track things like the "color" attribute on a file (that shows up in Finder)? Or is this tracked via the .DS_Store hidden file?

0x0 · on July 20, 2016

It uses XAs, check with:

  mdls some-file-tagged-green
  (...)
  kMDItemUserTags                = (
      Green
  )

or with xattr:

  xattr -l some-file-tagged-green
  (...)
  com.apple.metadata:_kMDItemUserTags:
  00000000  62 70 6C 69 73 74 30 30 A1 01 57 47 72 65 65 6E  |bplist00..WGreen|

swift · on July 20, 2016

I believe the .DS_Store files are just a fallback so if you are accessing files on a network share or on a file system that doesn't support extended attributes like FAT32, those features can still work. The native implementation on HFS+ uses the extended attributes.

0x0 · on July 20, 2016

.DS_Store is where finder saves the folder ui configuation (icon positions, list mode, etc) and these files are present even on native HFS+ (although "hidden" by default like any other unix file name starting with ".")

There were some reports online that the future APFS in the 10.12 betas didn't leave .DS_Store files around.

rwmj · on July 20, 2016

You missed:

- Unix xattrs have a terrible API and awful command line tools: listxattr(2) returning \0-separated character arrays with lists of attributes that are next to impossible to decipher in C? - check! Hiding certain xattrs by default based only on their names? - check!

- xattrs have magical qualities based on their names, the kernel version, the kernel configuration, and the filesystem mount options (eg. "security.selinux", "trusted.*")

- Some xattrs are \0 terminated (and the APIs set and return the \0 making them very awkward to use from shell scripts), some don't, and some are indeterminate. They can also be binary blobs.

ivank · on July 20, 2016

Also, add too many xattrs and you can no longer get a list of them:

       As noted in xattr(7), the VFS imposes a limit of 64 kB on the size of
       the extended attribute name list returned by listxattr(7).  If the
       total size of attribute names attached to a file exceeds this limit,
       it is no longer possible to retrieve the list of attribute names.

http://man7.org/linux/man-pages/man2/listxattr.2.html

nailer · on July 21, 2016

Is \0 ASCII null?

rwmj · on July 21, 2016

Yes. For more details see http://man7.org/linux/man-pages/man2/listxattr.2.html

Dylan16807 · on July 20, 2016

I think if "almost nobody uses it" it's fair to call it obscure.

> another case where Windows is more UNIX than UNIX

Windows has extended attributes too. Having both features makes it more like a kitchen sink.

chris_wot · on July 20, 2016

It's used in lots of places. Internet Explorer uses it to save whether a file was downloaded via IE. They are just only useful on NTFS, and often not even then because file hating utilities like Dropbox don't store them. So if you upload a file with an ADS to Dropbox, then copy it back again, you'll have lost that data.

One thing I'm not sure about is whether it appears in the file size when using dir. And if you apply a file hashing algorithm to generate a hash and you only use the file attributes, base file name and $DEFAULT data stream then you can append to the file via another data stream. So hash utilities need to be AFS aware to be truly useful in Windows.

Unless you are calling data an "attribute" though, it's really a bit of a silly comparison. Literally it's a seperate namespace in which you store data. The standard tools and utilities provided by Windows generally only look at $DEFAULT. The article is correct, git is pretty much doing something very similar, only the data is stored in .git (or specified somewhere else) and you use a tool like git to get access to that data, but you can also dive into the directory directly with any other tool. In Windows you use streams.exe, and it's a. generalised, b. non-portable as it's an intrinsic part of NTFS, and c. denoted as part of the NTFS filename by the delimiter ":", which is a reserved character and documented as such.

https://blogs.technet.microsoft.com/askcore/2013/03/24/alter...

besselheim · on July 20, 2016

Using dir normally does not display anything related to the alternate data streams, but if you use the /r option, their names and sizes will be shown.

kristianp · on July 20, 2016

Speaking of dropbox, it adds its own attributes, called com.dropbox.attributes with 83 bytes of binary data, according to 'dir /r'.

mbakke · on July 20, 2016

> Internet Explorer uses it to save whether a file was downloaded via IE.

Wait, but why? Chromium (at least on Linux) uses extended attributes too, but to record the origin and referrer of downloaded files (which can be really useful, once you know about it).

hug · on July 20, 2016

Chrome does it too. As does outlook, Firefox, possibly a bunch of other things. I think you'll find that the stream is zone-identifier. It can contain a value of 1 to 4, where each corresponds to a list of Windows' security zones. (Restricted sites, internet sites, local Intranet, and trusted sites from 4 to 1 respectively. There's a fifth option, zone 0, which is "local computer", but it's unused.)

This is the source of the prompts in Windows that say "this file came from the Internet, are you sure you wish to run it?".

int_19h · on July 20, 2016

To be more specific, IE (and most browsers on Windows, actually) use alternate streams to record that the file originates from the network, in a certain standardized way. When such a file is an executable file, and the user attempts to launch it (via Explorer; I don't think this happens for command line), they will get a confirmation dialog from the OS telling them that it's unsafe.

Other applications can perform similar checks on file formats that they handle, if the payload can be dangerous when untrusted. E.g. Visual Studio will give you a warning if you're trying to open a project file with this bit set.

ksherlock · on July 20, 2016

Solaris unfies it too. You can even use the runat command to open a shell where extended attributes are exposed as and can be manipulated as regular files.

http://docs.oracle.com/cd/E23824_01/html/821-1474/fsattr-5.h...

wfunction · on July 20, 2016

NTFS has extended attributes.

vocatus_gate · on July 21, 2016

How....do you know this kind of stuff? Great read, thanks.