<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-7306726567188713627</id><updated>2012-02-16T14:52:24.360-05:00</updated><title type='text'>CyPhy - Cybertaxonomy and Phylogenetics</title><subtitle type='html'>Mostly harmless pointing at things pertaining to cybertaxonomy and phylogenetics</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>18</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-4812340483159870428</id><published>2011-02-24T00:08:00.001-05:00</published><updated>2011-02-24T00:34:09.421-05:00</updated><title type='text'>You are in a maze of twisty little passages, all alike.</title><content type='html'>I poked at &lt;a href="http://twitter.com/#!/rdmpage"&gt;@rdmpage&lt;/a&gt; the other day pointing out that he should think about a 3D phylogenetic tree browser rather than thinking in 2D. Navigating 3D environments is very well worked out thanks to the multi-billion dollar gaming industry, it seems like one could borrow some of the concepts there and map them to viewing phylogenies.&lt;br /&gt;&lt;br /&gt;After a little back and forth I decided to do a brain dump based on some ideas that were discussed at the recent &lt;a href="http://gmod.org/wiki/GMOD_Evo_Hackathon"&gt;GMOD hackathon&lt;/a&gt; about linking phylogenies to genome tracks. &lt;a href="http://twitter.com/#!/hlapp"&gt;@hlapp&lt;/a&gt; chimed in that this might lead to a career in phylogame (tm?) programming. This resulted in the exercise taking a little more of a fantasy-role than first anticipated, but I think some of the original ideas might still come through. I don't claim that this solves Rod's problem (nor perhaps any real problem), but nevertheless here it is, an A-Z list of possible features for a 3D tree browser using a 3D-game-engine. Such a world might also act as a sort of &lt;a href="http://www.nytimes.com/interactive/2011/02/20/magazine/mind-secrets.html"&gt;memory palace&lt;/a&gt;. The major thing that's missing- some way to earn "points", because that's what people do in games, get points, so they must be important. If you have to ask- no I do not have a World of Warcraft account, I did however, beat &lt;a href="http://en.wikipedia.org/wiki/Akalabeth:_World_of_Doom"&gt;Akalabeth&lt;/a&gt;.  The graphic below in no way attempts to represent actual phylogenetic relationships, click to enlarge it.  If someone wants the illustrator file to modify I'll gladly pass it along.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/-NDlq6dmBdNA/TWXkt5QWQfI/AAAAAAAAASk/skt5IgE5-r8/s1600/3d_tree_browsing.jpg" imageanchor="1" style=""&gt;&lt;img border="0" height="300" width="400" src="http://1.bp.blogspot.com/-NDlq6dmBdNA/TWXkt5QWQfI/AAAAAAAAASk/skt5IgE5-r8/s400/3d_tree_browsing.jpg" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;A - The world consists of a series of paths based on the tree(s) in question.  Visible paths are rendered in the ground plane.  As the user moves along the paths towards the horizon we see more paths, and the features on the horizon are updated to reflect the remaining tips of the tree.&lt;br /&gt;B - In the horizon pane we see a generalized summary of the clades (paths) we might follow if we continue forward on that path.  Much meta-data can be mapped to these features, and the surrounding environment.&lt;br /&gt;C - Bigger features on the horizon (trees?) indicate more paths, giving the user a sense of where the additional information content is.&lt;br /&gt;D - Trees can differ in their health (color of leaves), healthier trees are those that have clades that are supported by more data, or perhaps higher support values.&lt;br /&gt;E - A forest of features might indicate some measure of complexity of the clades in question, or maybe the total number of clades.&lt;br /&gt;F - The weather on the horizon (or above), indicates controversy with the clades in question, or perhaps uncertainty (clouds = doubt).&lt;br /&gt;G - Horizon features need not be trees, they might vary based on some parameter- mountains are rock solid clades.&lt;br /&gt;H - Like the original TOL &lt;a href="http://www.tolweb.org/tree/home.pages/treehouses.html"&gt; tree houses &lt;/a&gt; you could stop in to learn more about the clades/taxa in question.&lt;br /&gt;I - Trusty map radar.&lt;br /&gt;J - Glowing purple portals let you exit to other websites with context-dependent information.&lt;br /&gt;K - Are there genomes for the clade in question?  Lamps, breadcrumbs or other features illuminate parallel paths.&lt;br /&gt;L - Pubs indicated nodes where social-media are discussing the clade in question, pop in for a virtual drink.&lt;br /&gt;M - People are authors, a crowd at a node corresponds to many unique authors.&lt;br /&gt;N - Click to interact.&lt;br /&gt;O - Lateral planes could alternatively map to other metadata. In a 2D system (moving only forward and back) looking left or right could bring up charts/graphs, images or other metadata.&lt;br /&gt;P - Paths could turn to springs, then water courses, for aquatic clades.&lt;br /&gt;Q - For the kids- BATTLE DINOSAURS!&lt;br /&gt;R - Author avatars greet you at the trees they have published.&lt;br /&gt;S - Synapomorphies are statues, monuments or other features along the road.  Examine them to learn more. &lt;br /&gt;T - Many trees are contradicted in places, swamps join clades to illustrate alternative arrangements.&lt;br /&gt;U - Fruit on your tree?  Someone has left comment(s) or feedback. &lt;br /&gt;V - Strange auras?  Multiple equally optimal trees exist.&lt;br /&gt;W - Use the magic wand to toggle through things with auras.&lt;br /&gt;X - Paths can be paved based on the underlying data supporting that clade (gold = morphology, cobblestone = single copy nuclear genes, mud = CO1).&lt;br /&gt;Y - Paths to terminal nodes end with a &lt;a href="http://phylopic.org/"&gt; picture &lt;/a&gt;.&lt;br /&gt;Z - Look up! Constellations form alternative phylogenies.&lt;br /&gt;&lt;br /&gt;Thanks to &lt;a href="http://phylopic.org"&gt;PhyloPic&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-4812340483159870428?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/4812340483159870428/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=4812340483159870428' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/4812340483159870428'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/4812340483159870428'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2011/02/you-are-in-maze-of-twisty-little.html' title='You are in a maze of twisty little passages, all alike.'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/-NDlq6dmBdNA/TWXkt5QWQfI/AAAAAAAAASk/skt5IgE5-r8/s72-c/3d_tree_browsing.jpg' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-4125933588300345888</id><published>2009-11-14T00:23:00.003-05:00</published><updated>2009-11-14T00:24:20.922-05:00</updated><title type='text'>Longevity, agility, and standards in biodiversity informatics</title><content type='html'>This post is mixed bag of equal parts #tdwg09 tweets response, clarification of the philosophy behind mx, and end-of-the-week-flu-induced rant.  I really don't mean it to be an argument for using mx vs. say, Scratchpads (that's sort of an apples-oranges argument). I've always been &lt;a href="http://hymenoptera.tamu.edu/wiki/index.php/Why_use_mx"&gt;very upfront&lt;/a&gt; in my cautioning potential users as to the pitfalls of using mx.   I'm very aware of the arguments for standardization, stability etc., however, what is also painfully clear is that there is far to little diversity in biodiversity applications, i.e. tools which enable scientists to do new science.  The worlds biodiversity is 10% described, and there are only so many ways we can mark up, search, map, and re-index this minuscule fraction of potential knowledge.  Where are the new &lt;span style="font-style: italic;"&gt;applications&lt;/span&gt; that are going to capture the remaining 90% of the data while making it &lt;span style="font-style: italic;"&gt;inherently&lt;/span&gt; more useful?&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Agility&lt;/span&gt;&lt;br /&gt;A couple of premises.  New science happens fast, really fast, to accommodate it we need "agile" (yes it's cliche) solutions.  The funded life of many projects happens within 1-3 years, and the notice of funding is abrupt. mx exists, in part, because  I need an environment that I can quickly modify if I want to test new things (&lt;a href="http://hymao.blogspot.com/"&gt;visualizations&lt;/a&gt;), promote new ways of thinking about taxonomy (morphological ontology-based descriptions), mock up proof of concepts for grants, or provide new functionality for my self or my research colleges.  I need to be able to provide features and functionality for active research immediately, as in &lt;span style="font-style: italic;"&gt;today&lt;/span&gt;.   The consequences of this type of approach is not always positive, among other things the probability of more features failing increases because they met too specific a need or just generally sucked is higher.  The benefit of providing many mutations is that the best (one-click matrix coding) are rapidly selected for.&lt;br /&gt;&lt;br /&gt;One practical example, poking at Scratchpads (because I think it's a great project) to make a point.  This year Scratchpads is reporting its Nexus importing functionality.  Last year mx needed to import Nexus formatted matrices and I justified the time to develop a general purpose parser and importer because we had people who would immediately be using the functionality.  The functionality and has been available for over a year.  This year we've added a more robust framework for handling sequence data (FASTA uploads automatically tying to specimen data) because I and my colleges will benefit greatly from it (we've already added hundreds of extract, pcr, and specimen records).  I have no doubt that "next year" Scratchpads will get FASTA support.   There is also no doubt that at some point Scratchpads, or some other uber project (e.g. Lifedesks, Lucid or Mesquite) will have all the functionality that anyone could ever &lt;span style="font-style: italic;"&gt;dream of right now&lt;/span&gt;.  At that point somebody will want to do something else, and it will take years for them to catch up and do that something else.   You get my point.  This definitely doesn't imply that mx will have everything a biodiversity app needs, nor that it is better, more quickly developed, more comprehensive, more reasonable or any such claim,  it's just an illustration of a very pragmatic issue that biodiversity apps in general need to address- we need tools and we need them now.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Standards&lt;/span&gt;&lt;br /&gt;Scientists will capture whatever data it is that is interesting to them, and waiting to do science because some standard is in place is a sure fire way to fail.  One example among many-  As various folks have tweeted (see #tdwg09, and if you dare read taxacom) there is some "discord" about the role of LSIDs in biodiversity informatics.  Had my first priority been to ensure that everything that mx did was LSID compliant I would have spent an inordinate amount of time on functionality that to date I'm still not sure is going to be useful.  The same goes for SDD, does anybody actually use this to do real work, and how might I find out if they did?   It might turn out that LSIDs are critical, in which case I believe that mx is perfectly situated to take advantage of them, and perhaps even provide them. &lt;br /&gt;&lt;br /&gt;I have very little fear that the data model used by mx is somehow incompatible with core biodiversity data. We've already provided or capture data in a large number of  formats including DarwinCore, Nexml, OBO, FASTA, Nexus, TNT, ITIS import tables, and have many mechanisms for exporting data for users. Since mx is open source  it's open to anyone (particularly those with ample experience- nudge nudge) to hack up new ways to expose or manage standards-compliant  data.   Getting data "out" will always be relatively easy, making data digital (getting it "in") is hard work that 1) doesn't get enough credit and 2) doesn't get enough funding.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Longevity&lt;/span&gt;&lt;br /&gt;mx originally evolved largely out of my desire to have a unified environment for my &lt;span style="font-style: italic;"&gt;research&lt;/span&gt; as a systematist.  It exists because at the time no solution existed that integrated the types of data I wanted to manage.  I maintain that no such solution (including mx) yet exists today.   It is meant to by my "life-long-companion".  If I get hit by a bus (better not go outside today on the 13th) will mx die? Probably.  Will the data be lost? If someone finds it useful then probably not. Is it in some cryptic silo that only I know of? No- source is on SF, numerous labs have vested interested in keeping their data pertinent.   Longevity should not get in the way of getting things done in the first place.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;mx&lt;/span&gt;&lt;br /&gt;mx is  evolving, and funding or not I see no reason why it can't continue to do so.   Because it does a lot of different things its role is somewhat foggy (harder to pin down and bash, I like it that way), but some central objectives are emerging:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Focus on functionality that allows researchers to do work.  When in doubt about standards capture verbatim data.  Worry about who to make friends with (which standards to adopt) when the apocalypse ends.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;mx is not a final repository for data (is there such a thing?). It is an environment for gathering and working with data.  When you publish your data your sequences should go to Genbank, your taxon names to Zoobank, your descriptions to EOL and plazi, your trees to Treebase, your specimen data to... oh hell just send it all to Rod Page.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Provide a tool that helps to generate new data that others want (the best way ensure longevity is to have useful data), then give others a way to access that data however they may want  (a flexible web-based framework like Rails)- they just have to hack on their preferred API.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;mx is not (just) about making taxon pages, taxon pages are somewhat of a nebulous concept that is best compiled by larger entities (EOL, ALA) with budgets for flash.  We've provided some of the basics to make pages, but we hope that these are stopgaps for an integrated approach with larger projects.  The multipurpose utility of an environment like mx is perhaps best exhibited by the &lt;a href="http://www.blogger.com/post-edit.g?blogID=812677668794382905&amp;amp;postID=268863725306378981"&gt;HAO&lt;/a&gt; which was rapidly prototyped in mx and then NSF funded.  The end product of the HAO is an OBO file which alone justifies the work on the application which may or may not exist in 5 years.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;mx is an expert system and it makes no apologies for being so.  Taxonomists, phylogenetists and ontologists are &lt;span style="font-style: italic;"&gt;experts&lt;/span&gt; that work with &lt;span style="font-style: italic;"&gt;complex data&lt;/span&gt; (that can't be handled by wikis...yet).  Just as we don't expect mathematicians to use the Microsoft Equation Editor, but rather to understand the complexities of software like Mathematica, and expect engineers to use CAD, we should expect people who study biodiversity to be expert informatics tool users.  Another way of saying this- you see nobody convincing math profs to do their work with wikis instead of Mathematica/R etc., there simply isn't the specialization in these applications to enable the data exploration and management that needs to be done (again, this is a big "for the foreseeable future" ... during which many grants will be written and much research will be done).  While mx wants things to be easy and straightforward as possible (it's likely failing miserably now) it is not aspiring to be the Microsoft Word of Biodiversity Informatics.&lt;/li&gt;&lt;/ol&gt;mx is open source largely for selfish reasons- I wanted to see if there are other systematists who hack (whether it is possible to do both at the same time is unclear and will be the topic of an upcoming post), and hoped for mutual benefits if such existed.  It's an experiment which I hope will slowly emerge as a successful one-  big matrices take a long time to compile, as do taxonomic catalogs, specimen inventories and complex ontologies.   These data in instances of mx are already significant (e.g. 85k &lt;span style="font-style: italic;"&gt;new&lt;/span&gt; matrix cells from over 1800 characters and 5400 character states, 7700+ OTUs,  40k references, 12k taxon names, 60k tags, an ontology of 2500 morphological terms etc.) and the commitment to using and funding mx suggest an exciting future.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-4125933588300345888?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/4125933588300345888/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=4125933588300345888' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/4125933588300345888'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/4125933588300345888'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2009/11/longevity-agility-and-standards-in.html' title='Longevity, agility, and standards in biodiversity informatics'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-6320767228550393062</id><published>2009-09-06T08:07:00.022-04:00</published><updated>2009-09-07T14:15:39.196-04:00</updated><title type='text'>Nomenclature is Dead! Long Live Pragmatic Taxa!</title><content type='html'>&lt;a href="http://www.hyam.net/blog/archives/598"&gt;Roger Hyam's blog post&lt;/a&gt; is great. A few somewhat random thoughts on it follow, with apologies in advance to Roger for any (likely) misinterpretations. I don't read taxacom, so there might be some redundancy here.  I'm not sure that what follows makes any sense, nor is it my personal position, that is, I'm trolling.&lt;br /&gt;&lt;br /&gt;What Roger seems to point out, and what I've argued to others in the past is that something like this will ultimately exist in the future:&lt;ol&gt;&lt;li&gt; In the future we will have &lt;span style="font-style: italic;"&gt;a lot&lt;/span&gt; of data on "the web".&lt;/li&gt;&lt;li&gt;These data will be &lt;span style="font-style: italic;"&gt;very&lt;/span&gt; accessible to machines OR humans (hereafter "mahus").&lt;/li&gt;&lt;li&gt;The way mahus will access &lt;b&gt;data of interest&lt;/b&gt; will be by using other &lt;b&gt;da&lt;/b&gt;ta/&lt;b&gt;o&lt;/b&gt;bservations,/&lt;b&gt;me&lt;/b&gt;asurements (hereafter lumped as "daomes").&lt;/li&gt;&lt;li&gt;Since taxonomic names are not daomes&lt;span style="font-style: italic;"&gt; they&lt;/span&gt;&lt;i&gt; are are pointless.&lt;/i&gt;&lt;/li&gt;&lt;/ol&gt;How does this work? Let's imagine some sci-fi:&lt;br /&gt;&lt;br /&gt;Situation A: The mahu plumber laying fibre optic cable is having a bad day. In the depths of its subterranean workplace something is fouling its locomotion.  The mahu applies a vacuum to the space itself sucking some of the offending stuff into a daome chamber.  Some daomes are taken, including a daome which identifies some hydrocarbon present only in life (at least according to the mahu's reference ontology).  Life has DNA, therefor the mahu's built in DNA reader makes another daome, which indicates that the offending stuff is &lt;a href="http://images.google.com/images?hl=en&amp;amp;um=1&amp;amp;sa=1&amp;amp;q=colony+soil+red+sting&amp;amp;aq=f&amp;amp;oq=&amp;amp;aqi=&amp;amp;start=0#start=0&amp;amp;imgc=specific&amp;amp;imgcolor=red"&gt;a red, stinging thing that lives in colonies in the soil&lt;/a&gt;, fouls up electrical equipment, and can be killed by a dose of the contents of the red canister attached to the side of its locomotory appendage. The pesticide is applied, the problem goes away, and the mahu continues plumbing.&lt;br /&gt;&lt;br /&gt;Situation B: Back on the surface two mahus are playing. Something hops onto the field in front of them and they want to know whether this thing might be dangerous and interfere with their game. They can see that this thing has &lt;a href="http://images.google.com/images?imgcolor=red&amp;amp;imgtbs=l&amp;amp;hl=en&amp;amp;sa=1&amp;amp;q=wing+beak+feather+song&amp;amp;aq=f&amp;amp;oq=&amp;amp;aqi=&amp;amp;start=0&amp;amp;imgc=specific"&gt;a wing, beak, feather, is red and is singing a song&lt;/a&gt; (I'm looking at one out my window now).  A few interactions with the cloud later they realize that they are looking at &lt;a href="http://www.allaboutbirds.org/guide/Northern_cardinal/id"&gt;something that is mostly harmless&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Both cases illustrate that we don't need to work with names at all in many cases.  Science would benefit immensely if we depended on daomes rather than names, since all our hypotheses and inferences would necessarily be tightly integrated with our primary daomes.  Among other things this would result in an excellent audit trail, as is required by good science.&lt;br /&gt;&lt;br /&gt;Note that this system only works if #2 above is (mostly) true.  Names are useful now because data are not accessible.  Names act as temporal place holders that allow us to jump from one reference to another without losing track of where we are.  If data are unified, linked, and very easily (=instantaneously) accessible we don't need these place holders to do work.&lt;br /&gt;&lt;br /&gt;Roger writes "If the purpose of taxonomy is to produce a system that people can actually use to hang data on - so that both people and machines can then infer  more knowledge from the linked data - then this is really the only game in town." I think that Roger doesn't want to do taxonomy, he wants to do &lt;i&gt;phenomics&lt;/i&gt;. He wants to produce a system of data that we can hang names on. This is a good idea.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Uncertainty&lt;/b&gt;&lt;br /&gt;&lt;hr /&gt;I have minor quibbles with Roger suggesting that DNA is the way to implement the necessary nomenclatory system, though this might be misunderstanding on my part.  If his system works then I think it can work with genomic or phenomic daomes. What I'm doing here is disagreeing, for the point of argument, with Roger's statement "What the machine can’t do is take into account additional properties, that weren’t considered in the first place, like the human user can.", and taking the discussion in a different direction.&lt;br /&gt;&lt;br /&gt;Roger's method works by instantiating a taxon and pointing to it with &lt;span style="font-style: italic;"&gt;hard link&lt;/span&gt;.  A hard link is something with two features, it is both an address and a property of the class itself. It's like using an id in a relational database as a meaningful property of the record it identifies, for instance using your SSN as a id for your medical records.  Creating hard link identifiers in a RDBMS is typically taboo. His &lt;i&gt;hard link&lt;/i&gt; is a string of letters, which just happens to be a DNA barcode, which is a property of an instance of his taxon. Note that &lt;span style="font-style: italic;"&gt;hard links&lt;/span&gt; could be anything, it need not be a string of letters referring to a DNA barcode, it could be "blue".  "Blue" the string of letters is unique, it's not "BlueRed" or "RedBlue", and it's also a property of the (polyphyletic) taxon that is blue.&lt;br /&gt;&lt;br /&gt;There is another subtle aspect to these identifiers. &lt;a href="http://images.google.com/images?hl=en&amp;amp;source=hp&amp;amp;q=blue+cows&amp;amp;gbv=2&amp;amp;aq=f&amp;amp;oq=&amp;amp;aqi=g3"&gt;"Blue" is obviously not unique as a property&lt;/a&gt;, and "ACGTG" (of suitable length) is. How about 475.0nm, the wavelength of blue light?  That's as unique as some long string of "ACGTG".  Thou taxon shall be identified by 475.0, not 475.1, nor 475.2, 457.3 is right out (or something like that, my &lt;a href="http://en.wikipedia.org/wiki/Monty_Python"&gt;MP&lt;/a&gt; is not so good).  My taxon is &lt;span style="font-style: italic;"&gt;Rhododendron luteum&lt;/span&gt;[475.0], Roger's is &lt;span style="font-style: italic;"&gt;Rhododendron luteum&lt;/span&gt; [ACGTG...].&lt;br /&gt;&lt;br /&gt;Going from "Blue" to "475.0" is really not that hard, it's just a matter of using a reference ontology to link the two (well, ok, that's a little tricky). I mean "blue" as in &lt;span style="font-style: italic;"&gt;this ontology&lt;/span&gt;. This brings up a slightly more abstract possibility.  What if within &lt;span style="font-style: italic;"&gt;this ontology&lt;/span&gt; we include logically consistent classes that are compositions which reference complex phenotypes.  As required by Roger's system our hard link both identifies a taxon, and is itself unique. Our identifier might look like this: "headsbluespinesverylongeyesredandsomeothertextthatjusthappenstodescribemytaxonandmakeslongstring". In the same way we might see issues with using "blue", we can instead point to something more specific in our reference ontology, so our human label just becomes some identifier, which references the class in the ontology, which logically defines our very long string.&lt;br /&gt;&lt;br /&gt;The problem with using blue or phenomic characters is that the phenome ontology doesn't exist, and will be difficult to make. This shouldn't prevent us from trying.&lt;br /&gt;&lt;br /&gt;The real issue with Roger's system is pragmatic.  It is that it requires that [A_label] points to a method(s) which return a daome which is a hard link of the class [A_Label].  In other words, the labels "475.0", "ACGTG" and "headsbluespinesverylongeyesredandsomeothertextthatjusthappenstodescribemytaxonandmakeslongstring" are &lt;span style="font-style: italic;"&gt;meaningless without context&lt;/span&gt;, and, unfortunately, context brings with it uncertainty. How exactly do I generate the daomes that also act as hard links?  You likely require YAO (yet another ontology) which documents an experiment which when performed returns a daome which has some &lt;span style="font-style: italic;"&gt;reasonable chance&lt;/span&gt; of being a hard link of the type you want. Since uncertainty is necessarily invoked when someone generates daomes of any type Roger's nomeclature using genomic daomes is no more valid from an ontological standpoint than is nomenclature using phenomic daomes. You can never completely be sure you are returning the string of letters that the label [atpH] is meant to return.  This is true for any number of reasons, &lt;a href="http://www.pnas.org/content/105/36/13486.abstract"&gt;many of them well documented&lt;/a&gt;, many just due to the nature of making observations.  The leap from label, to daome, is huge.  Note that what Roger wants, a completely logically consistent ontology and all the goodness that comes with it,  can't just mostly work (as he notes), it has to completely work, or it completely collapses. If we are OK with a system that "mostly works", then we should be OK with either phenomic or genomic approaches. One last way of saying this- it's all well and good if the ontology is completely internally consistent, but if I have to guess at whether my external data match the ontology, then guessing about morphology is as equally valid as guessing about DNA.&lt;br /&gt;&lt;br /&gt;What typically happens in this point of the argument is that the issue becomes a pissing-match about how accurate we want to be (e.g. 80% correct identification),  and whether or not genome out performs phenome in this regard.  This is of course an argument of method, not nomenclature, so it's changing the topic.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Pragmatic Taxa&lt;/span&gt;&lt;br /&gt;&lt;hr /&gt;There are obviously lots of other uses for names, I don't see them going away anytime soon.  One way to perhaps encourage (force?) people to think about the inverted process Roger is promoting would be to have journal editors require a statement of purpose with each new taxon circumscription.  If you want to provide a new name, you have to provide a statement indicating what your taxon circumscription could be used for, i.e. make it pragmatic.  The only rule is that the statement can not be "If you use my description then you can identify more of my taxon." That's a given.  Statements could be something along the lines of providing the species concept definition you're using but more specific.  It should come in the if-then form.  If variation in the wing length of my taxon is studied then we can say something about flight dynamics. One fallout of this requirement is that there would quickly become several accepted "stock" answers, one of which might be "If you use the characters I discuss, which are all referenced in an &lt;a href="http://hymao.org/"&gt;ontology&lt;/a&gt;, then a mahu can reason over your results and mine."  A nice way of getting data integrated into ontologies.&lt;br /&gt;&lt;br /&gt;p.s.  I purposely used the nonsense "daomes" and "mahus" not to confuse but rather to see if I can trace their reference in future discussion.  Note also how well Google does with a few completely general terms in it's image search.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-6320767228550393062?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/6320767228550393062/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=6320767228550393062' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/6320767228550393062'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/6320767228550393062'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2009/09/nomenclature-is-dead-long-live.html' title='Nomenclature is Dead! Long Live &lt;i&gt;Pragmatic&lt;/i&gt; Taxa!'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-457881441008130994</id><published>2009-05-15T23:38:00.004-04:00</published><updated>2009-05-15T23:58:29.560-04:00</updated><title type='text'>wolfram alpha + taxonomy = ?</title><content type='html'>So whenever a new search engine is introduced I like to test it with the family of wasps I specialize on, the Diapriidae.  &lt;a href="http://www.wolframalpha.com/"&gt;Wolfram|alpha&lt;/a&gt; is getting a lot of hype lately, and tonight it went live.  While it largely choked and acted oddly (if it worked at  all) on my search terms, some searches did appear to complete.  I &lt;a href="http://www.wolframalpha.com/input/?i=Diapriidae"&gt;tested the term Diapriidae&lt;/a&gt;, needless to say it didn't do too well as parasitic wasps are not dinosaurs.  I suppose it gets some points for recognizing I was requesting something related to a taxonomic classification...maybe. "Hymenoptera" was more successful, but with (very) minimal results.  "total species of hymenoptera" could not be interpreted.  Searching for &lt;a href="http://www.wolframalpha.com/input/?i=taxonomy"&gt;taxonomy&lt;/a&gt; returns some interesting results, including "taxonomic networks" (&lt;a href="http://www.wolframalpha.com/input/?i=Osmunda+regalis"&gt;e.g.&lt;/a&gt;).  You can click to see the sources for a given result, and those sources are (nicely) linked,  Species 2000/ITIS is listed as one.  It doesn't appear that you can drill down much past orders of insects.  In general I'm very underwhelmed thus far.  The primary source is listed as Wolfram|Alpha curated data, it will be interesting to see if they expand there database, watch for job postings cybertaxonomists!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-457881441008130994?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/457881441008130994/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=457881441008130994' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/457881441008130994'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/457881441008130994'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2009/05/wolfram-alpha-taxonomy.html' title='wolfram alpha + taxonomy = ?'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-3769821256371816399</id><published>2009-01-11T00:21:00.007-05:00</published><updated>2009-01-11T00:54:19.460-05:00</updated><title type='text'>augmented reality</title><content type='html'>&lt;object width="425" height="344"&gt;&lt;param name="movie" value="http://www.youtube.com/v/Iu6dXmbgF44&amp;amp;hl=en&amp;amp;fs=1"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;embed src="http://www.youtube.com/v/Iu6dXmbgF44&amp;amp;hl=en&amp;amp;fs=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="344"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;&lt;br /&gt;This technology is pretty cool, and &lt;a href="http://www.youtube.com/watch?v=g8Eycccww6k"&gt;seems to have been around for a while&lt;/a&gt; (see also &lt;a href="http://www.youtube.com/watch?v=FqjSz-5P-a0"&gt;this more recent application&lt;/a&gt;).  I'm not sure how or what it has to do with cybertaxonomy, but it seems like it should have some application.  Perhaps the technology could be used by including the "fiducial marker" along with barcodes (paper) that are attached to mounted specimens.  By waving a device over a drawer of specimens one would get images (magnifications), meta-data, or some other cool or useful information.  Maybe you could build a physical tree, with something like &lt;a href="http://www.meccano.com/"&gt;meccano&lt;/a&gt; or &lt;a href="http://lego.com/"&gt;lego&lt;/a&gt; pieces "enhanced" with these markers, the relationships among the physical branches could be interpreted in the augmented reality, perhaps mapping character state transitions onto the physical tree, overlaying geographic distributions, or some such.  This could also be a great way to get kids into museums.  First, at sites (schools?) away from the museum, hand out "game" cards each with some information on an organism and an aforementioned marker.  These cards could be brought to a museum that housed an augmentation system.  There once "invisible" information would be revealed, for example movies, 3d representations, or pointers to where real live versions in the museum could be found.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-3769821256371816399?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/3769821256371816399/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=3769821256371816399' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/3769821256371816399'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/3769821256371816399'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2009/01/augmented-reality.html' title='augmented reality'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-6442270369711799281</id><published>2008-09-21T20:00:00.008-04:00</published><updated>2008-09-21T21:56:08.767-04:00</updated><title type='text'>ping (visualization)</title><content type='html'>Popping up to note that from Rod Page's post on the &lt;a href="http://iphylo.blogspot.com/2008/09/postphylogenetics.html"&gt;recent Nascent meeting&lt;/a&gt; it appears that visualization is a hot topic. If you aren't following Moritz Stefaner's  &lt;a href="http://well-formed-data.net/archives/158/maeve"&gt;well formed data blog&lt;/a&gt;, and you are interested in visualization, then you are really missing out. His &lt;a href="http://well-formed-data.net/archives/158/maeve"&gt;latest post&lt;/a&gt; brings up all sorts of possibilities for phylogenetic or taxonomic implementations. For kids or students, why not print up a &lt;a href="http://www.blogger.com/www.wizards.com/magic/"&gt;collectible card game&lt;/a&gt; &amp;lt;cough&amp;gt;was a long time ago&amp;lt;/cough&amp;gt; with a biodiversity theme, embed interesting data in the cards, and marvel as phylogenetic relationships  (or ecological etc.)  appear when the cards are placed on that &lt;a href="http://portal.mace-project.eu/maeve/"&gt; &lt;span style="font-weight: bold;"&gt;ridiculously cool table&lt;/span&gt;&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-6442270369711799281?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/6442270369711799281/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=6442270369711799281' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/6442270369711799281'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/6442270369711799281'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2008/09/ping-visualization.html' title='ping (visualization)'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-9180674547016062684</id><published>2008-04-30T23:25:00.006-04:00</published><updated>2008-05-01T00:14:58.256-04:00</updated><title type='text'>Relationalizing Nexus files with Ruby and mx</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_nUND2t4xLRE/SBk80nnwGkI/AAAAAAAAABw/bTgFODoZKBY/s1600-h/mesquite2mx.jpg"&gt;&lt;img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer;" src="http://2.bp.blogspot.com/_nUND2t4xLRE/SBk80nnwGkI/AAAAAAAAABw/bTgFODoZKBY/s400/mesquite2mx.jpg" alt="" id="BLOGGER_PHOTO_ID_5195250519835155010" border="0" /&gt;&lt;/a&gt;     Ok, so "relationalizing" isn't really a word, but I kind of like how it sounds.  For the past couple of weeks I've been writing a &lt;a href="http://mesquiteproject.org/"&gt; Mesquite &lt;/a&gt; (i.e. Nexus)  file parser in Ruby.  It uses the same basic lexer/parser engine that reads Newick formatted trees that I mentioned in a previous post to create a Ruby Nexus file object, with all the good bits (well most of them, some blocks are not parsed yet, but that's just a matter of extending the parser) easily accessible from the object.  With this Nexus file object it was relatively trivial to write a conversion to &lt;a href="http://hymenoptera.tamu.edu/wiki"&gt;mx&lt;/a&gt; (see fig.), i.e. a fully relational format.   &lt;br /&gt;&lt;br /&gt;The Ruby file parsing code is currently a plugin/library in the mx source, it can be easily extracted for use in other projects.  Look for the code in mx 0.2.1540 and onwards when it makes it to Sourceforge, or contact me directly if you're really keen to get your hands on it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-9180674547016062684?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/9180674547016062684/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=9180674547016062684' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/9180674547016062684'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/9180674547016062684'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2008/04/relationalizing-nexus-files-with-ruby.html' title='Relationalizing Nexus files with Ruby and mx'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_nUND2t4xLRE/SBk80nnwGkI/AAAAAAAAABw/bTgFODoZKBY/s72-c/mesquite2mx.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-2450336426732493624</id><published>2008-04-01T21:46:00.005-04:00</published><updated>2008-04-02T23:07:54.316-04:00</updated><title type='text'>Plasto-types?</title><content type='html'>&lt;a href="http://it.slashdot.org/it/08/04/01/2257244.shtml"&gt;Slashdot&lt;/a&gt; and the &lt;a href="http://news.bbc.co.uk/2/hi/science/nature/7324564.stm"&gt;BBC&lt;/a&gt; have picked up a story on the use of X-ray Radiography to look at fossil insects.  I've seen some pre-press examples of the same technology on extant bugs, the results are incredible, in part because all the soft tissues are left intact and you can get any cross section you want.  I'm really curious as to 1) the the resolution on the fossils, on extant critters it is apparently very close to SEM calibre; and 2) the cost.  This could be a huge boon to making fossil data available to phylogenetic studies.  The kicker at the end- the researchers suggest that the printed plastic insect could be designated as a type specimen.  In many ways this would have huge advantages, as anybody with the technology could print their own types. The real suggestion underlying this is not that the plastic itself is the type, but rather that a set of 1s and 0s can be typified, the plastic of course being generated from digital data. This of course opens up a whole can of (prehistoric?) worms.  How many 1s an 0s are needed before a (meaningful) type can be designated?  Can I use a CoolPix for imaging and maybe bundle my images with some 1s and 0s that encode for some specimen data, and typify the resulting zip file? From a pragmatic standpoint- why not?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-2450336426732493624?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/2450336426732493624/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=2450336426732493624' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/2450336426732493624'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/2450336426732493624'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2008/04/plasto-types.html' title='Plasto-types?'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-2833160281872198881</id><published>2008-02-29T19:10:00.003-05:00</published><updated>2008-02-29T20:03:06.700-05:00</updated><title type='text'>barcoding (dna) Google tech talk</title><content type='html'>Haven't seen this pointed at yet, a little state of the union from Hebert and Janzen that was posted on Google's tech talks. "It [barcoding] works with startling clarity." ;).  A number of interesting insights during the question period, with a general focus on scale, diminishing returns, and legal issues (curious that, coming from Google and all). &lt;br /&gt;&lt;br /&gt;&lt;object width="425" height="355"&gt;&lt;param name="movie" value="http://www.youtube.com/v/eGYAMDGMraA"&gt;&lt;/param&gt;&lt;param name="wmode" value="transparent"&gt;&lt;/param&gt;&lt;embed src="http://www.youtube.com/v/eGYAMDGMraA" type="application/x-shockwave-flash" wmode="transparent" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-2833160281872198881?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/2833160281872198881/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=2833160281872198881' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/2833160281872198881'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/2833160281872198881'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2008/02/barcoding-dna-google-tech-talk.html' title='barcoding (dna) Google tech talk'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-8846217887614657895</id><published>2008-01-27T15:27:00.000-05:00</published><updated>2008-01-27T22:03:50.621-05:00</updated><title type='text'>misc.</title><content type='html'>Annotating images with user-created overlays is a must for defining morphological characters, or enhancing ontologies.  This seems to be a relatively tricky thing to do over the web.  &lt;a href="http://www.mainada.net/inputdraw%20"&gt; Inputdraw &lt;/a&gt; is a SWF widget that is free for noncommercial use.  It allows you to save overlays drawn on images into forms, the data are then saved as SVG text.   The &lt;a href="http://opencollection.org/"&gt;OpenCollections&lt;/a&gt; software also mentions an annotation system that is in the works, though the bottom line there is that its not quite ready prime time.  At a recent &lt;a href="http://morphbank.net/"&gt;Morphbank&lt;/a&gt; meeting someone (apologies for not remembering who) mentioned that another solution might involve using the &lt;a&gt;Google Maps API&lt;/a&gt; to create polylines or points on custom "maps", which would in fact be your images.   The &lt;a href="http://googlemapsbook.com/"&gt;Beginning Google Maps Applications with Rails and Ajax&lt;/a&gt; book is a decent starting point for implementing this approach.  Note that you would need .gif or .png formatted images if you attempt this. I used the Beginning Google Maps book to finally implement maps "natively" within mx (we still have hooks to &lt;a href="http://berkeleymapper.berkeley.edu/"&gt;BerkeleyMapper&lt;/a&gt;, which is a great service), but not without several hours of frustration.  While the book is quite clearly written the code provided in Chapter 3 is incomplete or erroneous in several very frustrating ways, so make sure to download the updated code from the website if you have the book.&lt;br /&gt;&lt;br /&gt;As I add new tables to mx  I'm starting to add fewer and fewer columns, with the idea that tagging can be used as the primary means of extending the basic objects.  Tagging essentially allows you to extend your records to as many fields as you want, and is therefor a very simple way to provide &lt;a href="http://en.wikipedia.org/wiki/Extensibility"&gt; extensibility&lt;/a&gt;.  A &lt;a href="boohttp://atomiq.org/archives/2008/01/tagging_the_book.html"&gt;new book on the tagging phenomenon&lt;/a&gt; by Gene Smith looks to be a must read, I think I'll order mine now.&lt;br /&gt;&lt;br /&gt;Finally, while somewhat older news, I keep thinking about  social annotation with respect to taxonomy, and also things like scoring phylogenetic characters.  How might things discussed in  &lt;a href="http://video.google.com/videoplay?docid=-8246463980976635143"&gt;this Google talk from Luis von Ahn &lt;/a&gt; be applied to systematics research? Within taxonomy one approach could be to simply photograph many specimens and then allow the general public to point out the similarities and difference.  These could then be vetted by the "experts" as a starting point. This approach, albeit with a greatly simplified "taxonomy" and character set, is being used at &lt;a href="http://galaxyzoo.org/"&gt;GalaxyZoo&lt;/a&gt;. In the GalaxyZoo example the galaxies have already been classified by computer algorithms into various types, so there is an excellent comparative dataset for testing things like the trustworthiness of the public contributions.  &lt;a href="http://www.peekaboom.org/"&gt;Games&lt;/a&gt; like those discussed by von Ahn could also be used in developing hypotheses of character homology.  In systematics we present homology hypotheses that are then further tested using phylogenetic analysis.  What if part of this testing requires that the definition of these hypotheses be agreed upon by two or more experts, using games like those discussed by von Ahn?  In theory this agreement is already required, to some degree, as implemented in the review process that occurs prior to publication.  It could, however, be made more explicit (and fun?). Given the right framework for playing these games there would be many beneficial spin offs including obvious things like annotated ontologies of morphological characters.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-8846217887614657895?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/8846217887614657895/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=8846217887614657895' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/8846217887614657895'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/8846217887614657895'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2008/01/misc.html' title='misc.'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-1522845098374004349</id><published>2008-01-13T19:03:00.000-05:00</published><updated>2008-01-13T19:21:06.628-05:00</updated><title type='text'>FigTree v1.1</title><content type='html'>Andrew Rambaut has posted a message to the beast-users listserv, he's just released a &lt;a href="http://tree.bio.ed.ac.uk/software/figtree/"&gt;new version of FigTree&lt;/a&gt;.  Among other things this new version allows you to re-root trees (something I bugged him for a while back, as I'm sure others did as well).  Perhaps even better, the &lt;a href="http://http//figtree.googlecode.com/"&gt;source code has been released&lt;/a&gt;.  With these new features, and now that others can conceivably add new bells and whistles, I suspect FigTree will remain the premier tree rendering software for some time.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-1522845098374004349?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/1522845098374004349/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=1522845098374004349' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/1522845098374004349'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/1522845098374004349'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2008/01/figtree-v11.html' title='FigTree v1.1'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-8324402782814467344</id><published>2007-12-22T11:48:00.000-05:00</published><updated>2007-12-22T12:15:28.042-05:00</updated><title type='text'>T(ree analysis using)N(ew)T(echnology) is free</title><content type='html'>&lt;a href="http://blindscientist.genedrift.org/2007/12/19/a-bit-of-tnt-in-your-morning-coffee/"&gt;This post&lt;/a&gt; at &lt;a href="http://blindscientist.genedrift.org/2007/12/19/a-bit-of-tnt-in-your-morning-coffee/"&gt;Blind.Scientist&lt;/a&gt; alerted me to the fact that &lt;a href="http://www.zmuc.dk/public/phylogeny/tnt/"&gt;TNT&lt;/a&gt; is now free, thanks to support from the Willi Hennig society.    That's worth repeating here.  The Blind.Scientist blog mentions a review is in the works, so check back there shortly. This is a great step forward as TNT has a lot of promise, particularly with respect to its scripting capabilities.  Much of that promise is locked up in, if I dare say, its somewhat cryptic help documentation, and buggy script examples.  I've worked a lot with TNT in the past, but haven't checked the latest documentation, so take this observation with a grain of salt (a small one).  I've mentioned to Pablo, and have heard others comment, that what is needed is a community wiki similar to that supporting &lt;a href="http://beast.bio.ed.ac.uk/"&gt;Beast&lt;/a&gt; (a wonderfully supported piece of software, see the active &lt;a href="http://beast.bio.ed.ac.uk/#Mailing_List"&gt;mailing-list&lt;/a&gt; for example).  This suggestion has thus far fallen on deaf ears, but maybe now that the software is more accessible (somewhat, it still isn't open source, which is a major strike against it IMO), something will happen. As a side note- I've had problems running the latest TNT on OSX 10.3.9, and Pablo wasn't able to get me a compiled copy that worked.  I'd be curious if anybody has similar problems, or whether its just my old PowerBook giving up the ghost.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-8324402782814467344?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/8324402782814467344/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=8324402782814467344' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/8324402782814467344'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/8324402782814467344'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2007/12/tree-analysis-usingnewtechnology-is.html' title='T(ree analysis using)N(ew)T(echnology) is free'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-9166681019066759333</id><published>2007-12-14T22:17:00.000-05:00</published><updated>2007-12-14T22:31:42.374-05:00</updated><title type='text'>pixels to Newick</title><content type='html'>Bioinformatics sports a &lt;a href="http://bioinformatics.oxfordjournals.org/cgi/content/short/23/24/3384"&gt;new paper&lt;/a&gt; on an application called &lt;a href="http://www.cibiv.at/software/treesnatcher/"&gt;TreeSnatcher&lt;/a&gt;.  The meat of the problem is essentially recognizing and converting a tree in a picture (pixels) to a representation in Newick format.  While nifty I'm not sure this is particularly useful, and perhaps its a reflection of the lack of awareness of sites like &lt;a href="http://www.treebase.org/treebase/index.html"&gt;TreeBase&lt;/a&gt;.  Older trees are relatively small, and can be rebuilt using Mesquite or other programs.  New trees should of course be archived somewhere so that they can be grabbed auto-magically, or, as my shadowy friends put it, "just ask the $%@* author for the trees". Fire up those screen-caps!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-9166681019066759333?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/9166681019066759333/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=9166681019066759333' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/9166681019066759333'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/9166681019066759333'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2007/12/pixels-to-newick.html' title='pixels to Newick'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-7265310092619419930</id><published>2007-11-26T00:58:00.001-05:00</published><updated>2007-11-26T01:22:45.731-05:00</updated><title type='text'>Not practical, but pretty.</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_nUND2t4xLRE/R0phJo4JSgI/AAAAAAAAAAM/Gb19gTLkH0Y/s1600-h/ns_tree.jpg"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;" src="http://4.bp.blogspot.com/_nUND2t4xLRE/R0phJo4JSgI/AAAAAAAAAAM/Gb19gTLkH0Y/s320/ns_tree.jpg" alt="" id="BLOGGER_PHOTO_ID_5137025143189096962" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;A nested set representation of a Newick tree loaded into &lt;a hfre="http://hymenoptera.tamu.edu/wiki/"&gt;mx&lt;/a&gt; (source in 0.1.1379).  Even though this isn't the best way to visualize a phylogenetic tree I was happy to just get this far.   Now to see if trees can be drawn using only margins of divs and position:relative, with maybe a little opacity: for good measure.  Don't hold your breath on that one.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-7265310092619419930?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/7265310092619419930/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=7265310092619419930' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/7265310092619419930'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/7265310092619419930'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2007/11/not-practical-but-pretty.html' title='Not practical, but pretty.'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_nUND2t4xLRE/R0phJo4JSgI/AAAAAAAAAAM/Gb19gTLkH0Y/s72-c/ns_tree.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-3740652238683032923</id><published>2007-11-23T22:35:00.000-05:00</published><updated>2007-11-23T23:30:11.157-05:00</updated><title type='text'>storing phylogenetic trees in mx</title><content type='html'>Turkey in belly I've sat down to flesh out the storage of phylogenetic trees in mx.   Krishna's phylo_tree lexer/parser creates nice Ruby objects of &lt;a href="http://www.molecularevolution.org/resources/fileformats/tree_formats.php"&gt;newick&lt;/a&gt; formatted trees, all I had to do was incorporate another of his efforts &lt;a href="http://opensource.symetrie.com/trac/better_nested_set/"&gt;better_nested_set&lt;/a&gt;, to translate the parsed tree to a database table, all in all just a couple lines of new code.  Visualization is another can of worms.  I think I'll likely wait till a nice Rails/Ruby graph (as in trees) generator comes out before attempting this.  As a thought experiment its interesting to think about how to display a tree completely in ascii, and not just drawing the branches with _, -, and |, but rather displaying groups of terminals, or labeled internal nodes in some meaningful manner depending on where you are in the tree.  Its late, so I envision starting with something like "you are in twisty set of passages all alike" ...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-3740652238683032923?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/3740652238683032923/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=3740652238683032923' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/3740652238683032923'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/3740652238683032923'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2007/11/mx-trees-and-pygmybrowser.html' title='storing phylogenetic trees in mx'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-3963462742602413951</id><published>2007-11-21T15:53:00.000-05:00</published><updated>2007-11-21T16:05:35.070-05:00</updated><title type='text'>Morphbank updated, now with matrices</title><content type='html'>&lt;a href="http://morphbank.net/"&gt;Morphbank&lt;/a&gt; has been updated with new features including the ability to create and score OTUs.  They will also be rolling out some new searches that return RDF and XML in the very near future, and functionality for allowing external clients to upload images.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-3963462742602413951?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/3963462742602413951/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=3963462742602413951' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/3963462742602413951'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/3963462742602413951'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2007/11/morphbank-updated-now-with-matrices.html' title='Morphbank updated, now with matrices'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-7160431991055366477</id><published>2007-11-07T17:57:00.000-05:00</published><updated>2007-11-07T18:09:35.163-05:00</updated><title type='text'>relation_browser plugin for Rails</title><content type='html'>The &lt;a href="http://dre.gs/2007/8/14/relation_browser-0-1"&gt;relation_browser&lt;/a&gt; plugin is a neat start at using a SWF plugin written by &lt;a href="http://der-mo.net/"&gt; Moritz Stefaner&lt;/a&gt; in a &lt;a href="http://rubyonrails.com"&gt;Rails environment&lt;/a&gt;.  I've implemented it in a crude manner in &lt;a href="http://hymenoptera.tamu.edu/wiki"&gt;mx&lt;/a&gt; as a taxon name hierarchy browser.  I'd like to get it to work with the simple ontology editor/displayer that's also in mx, though I'll likely wait to attempt this untill the plugin is a little more mature.  Every once in a while its definitely worthwhile to check out Agile's catalog of &lt;a href="http://agilewebdevelopment.com/plugins"&gt;Rail's plugins&lt;/a&gt; for gems like this.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-7160431991055366477?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/7160431991055366477/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=7160431991055366477' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/7160431991055366477'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/7160431991055366477'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2007/11/relationbrowser-plugin-for-rails.html' title='relation_browser plugin for Rails'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7306726567188713627.post-1057482075759058619</id><published>2007-11-06T21:46:00.000-05:00</published><updated>2007-11-06T21:58:26.947-05:00</updated><title type='text'>Flare</title><content type='html'>First post, maybe last.  The open source visualization toolkit &lt;a href="http://flare.prefuse.org/"&gt;Flare&lt;/a&gt; appears at first glance (and second) to have some potential as tool for rendering phylogenetic trees.  A quick look at the &lt;a href="http://flare.prefuse.org/demo/"&gt;demo reel&lt;/a&gt; should convince you of this, click on the "GraphView" button therein to jump to the trees bit.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7306726567188713627-1057482075759058619?l=cyphy.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cyphy.blogspot.com/feeds/1057482075759058619/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7306726567188713627&amp;postID=1057482075759058619' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/1057482075759058619'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7306726567188713627/posts/default/1057482075759058619'/><link rel='alternate' type='text/html' href='http://cyphy.blogspot.com/2007/11/flare.html' title='Flare'/><author><name>Matt</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
