ICBO 2014 Report, Day 1

ICBO has gotten off to a good start with a tutorial and three workshops. I mostly spent the day in the workshops, but the tutorial (OBO Tutorial: Getting Things Done with Open Biological and Biomedical Ontologies) introduces a number of interesting new tools, including Ontofox, Ontodog, and Ontorat. These complement the linked data ontology browsing tool Ontobee. It also included an introduction to RDF and SPARQL that looks like it was very helpful.

The <a href="http://icbo14.com/sessions/drug-drug-interaction-knowledge-representation-workshop/"Drug-Drug Interaction Knowledge Representation Workshop had a really eye-opening keynote about Drug-Drug Interactions (DDIs) by Dan Malone. He showed how there is little agreement among existing DDI authorities, because the standards of evidence are very low. According to him, most major drug interactions are supported by nothing more than case studies.  Many of these interactions persist in these resources mostly because of liability issues – if an interaction is pulled and a patient later suffers (or appears to suffer) from it, then the authorities might be held liable for withholding that information. He also showed some preliminary results that clinicians want to be presented with alternatives to and explanations of the DDIs they are warned about.

I also attended a morning workshop on developing a biobanking ontology using the Ontology for Biomedical Investigations. I think this sort of common vocabulary could have a similar benefit to what schema.org has made it easier to index structured information from web pages. Frank Manion also presented the start of an Informed Consent Ontology, which currently represents informed consent documents. Some notable thoughts and/or quotes:

  • “In data sharing, you’re sharing data with yourself in two years” – Dave Parrish
  • Ontology theory and development would be a really useful undergraduate elective course. – Penn Medicine Biobank team

I’ll check in again tomorrow!

4 Reasons Why Semantics Help Make Biobanks Better

My first blog post at 5AM is up:

The Semantic Web provides a means to link information on the web to each other and to things in real life in an interoperable way. Internationalized Resource Identifiers, of which URLs are a type, are used to identify nearly everything, and linked data makes it possible to visit those URLs to get more information about the things they represent. This has some very useful applications, especially in biobanking. Semantics was literally made for biomedical research, and here are 4 ways in which that relationship can help make biobanks better information resources…

Read more at http://info.5amsolutions.com/blog/bid/152921/4-Reasons-Why-Semantics-Help-Make-Biobanks-Better.

Validating RDF

I spent the day listening in on the RDF Validation Workshop, which kind of spilled over into the Cambridge Semantic Web Meetup. Here are my general musings and notes from the day. They may be completely wrong for many reasons, including possible misunderstandings of what the speaker said and information that is now out of date.

Google Says They’re Triplifying the Web

The Google Knowledge Graph is including a need to support users in how they can provide rich snippets in Google search results. They are building a validator for these formats against the RDF representations of their microdata. Most of their constraints are property paths, and they use SPARQL for the rest. They are also mostly concerned with suitability to their purpose, which is based on rich snippets and knowledge graph. They are using SPARQL-based constraints and are using RDFlib for prototyping, but will be moving to their own parser, which is used by the Structured Data Testing Tool. Here is an example path constraint, with the resulting SPARQL queries that are generated from it:

SELECT ?context WHERE {?context schema:flightNumber ?constraint.}
ASK WHERE {?context schema:flightNumber ?constraint.}

Currently, they are only validating things that are necessary, they won't check for things that are optional.

Semantic Web Meetup

The general idea seems to be that the RDF community needs to provide a means to say the following things about RDF graphs:

  • The graph must at least contain X.
  • The graph must contain at most Y.
  • The graph can never contain Z.

The general idea seems to be to provisionally close any given RDF graph before validation in order to produce the report. That closure can include some fixed set of other graphs (such as vocabularies used), but ultimately, for the purposes of validation, the Unique Name Assumption and the Closed World Assumption need to be used to validate the graph as given. Eric Prud'hommeaux presented an interesting framework based on YACC-style grammars by providing "shapes" of objects to validate. This is similar to OSLC's (Open Services for Lifecycle Collaboration) Resource Shape vocabulary, but with additional capabilities around disjunction and non-declarative validation processes.

Citing Your Sources on the Web

I was involved in the World Wide Web Consortium (W3C) Provenance Working Group, which was an amazing experience, even though I couldn’t put as much time into it as I would have liked. My friend and collaborator, Tim Lebo, edited the Provenance Ontology (PROV-O). PROV-O is, in my narrow perspective of the world, a fantastic foundation for talking about how stuff happens and, most importantly to this post, how to cite people and resources on the web.

Continue reading

Getting Up Early in the Morning

Well, sort of. I have some exciting news: in August I will be starting at 5AM Solutions as a data scientist. I’ll be finishing my time at Yale University with Michael Krauthammer, and will soon be wrapping up my computer science Ph.D. at Rensselaer Polytechnic Institute in the Tetherless World Constellation. Continue reading

Thanksgiving Science!

I’ve got a little formula that predicts how long it will take for our Thanksgiving turkey to cook. It works really well for our temperatures and preparation, but I’d like to make it a little more general so everyone else can use it, regardless of temperature. As a wise man once said, if it’s worth doing, it’s worth overdoing, unless you’re overcooking turkey.

Towards that end, and because this is a science blog, I would like to perform a hypothesis-generating experiment. If you’re willing to further science, please share some details about how you prepared your turkey, and how it turned out. Humanity will thank you. Turkeys will not. I will post the results when I can, and maybe we can try again next year for a full prediction.

Click here to share your turkey data.

How to fix the economy

  1. I’m not an economist, so this probably has errors in it.
  2. None of these ideas are mine. I’ll hunt down where I got them from as soon as I can. See citations where I put them in.

Businesses aren’t investing in jobs because they have no reason to. They sit on huge piles of cash and continue to make acceptable profits without adding any jobs. Astonishingly, we manage to be in a period when inflation is nil, but prices are still going up. You know what gets people investing in new things? Inflation. Mild-to-moderate inflation drives capital into investments because otherwise the value of that money decreases over time.[1]

How would someone in charge of, say, the Fed, manage to increase inflation? They start by buying debt from banks, which frees up cash in existing banks to make other loans. Home mortgages would be a great start. The Fed can easily start buying up underwater mortgages and forgive the principal above and beyond the current market value. This puts a floor on the housing market and stops the glut of foreclosures, which gives the construction industry something to do again.

I would also start buying up and forgiving student loans of unemployed college grads. This would put them in a position to take greater risks with their careers, either taking jobs that aren’t directly in their fields or giving them room to start up businesses of their own.

This would be hugely controversial, of course, but the Fed chairman is appointed so that he can make these calls. We need something decisive either way, so someone needs to bite the bullet.

But what about everyday prices? Increasing inflation means, in the short term, increases in prices of stuff in general. True, but with the hardest hit people getting loan forgiveness, their overall costs will decrease. Also, investment in new technologies generally decreases real prices even as apparent prices go up. Finally, inflation doesn’t actually impact the poor significantly because they have little savings. In an economy with inflation, wages tend to increase with inflation.

On the fiscal side, there needs to be a resolution to the structural deficit spending. Most of this comes from the Bush tax cuts. Letting them expire takes care of 1/3 of the problem. Another portion of the problem comes from health care costs in medicare and medicaid. These costs can be amortized over a larger population by combining medicare and medicaid into one pool and opening up that pool to everyone. Medicare and Medicaid take care of the most expensive patients in the country, so opening it up to the rest of us would only decrease the overall risk. [2]

Finally, social security can easily be addressed by increasing the witholding limit above 90k. This will put it on sound footing for the next 100 years, based on figures I’ve heard. [2]

  1. Krugman, Paul. Why Is Deflation Bad? http://krugman.blogs.nytimes.com/2010/08/02/why-is-deflation-bad/
  2. Reich, Robert. What If Everyone Saw This Clip Of Robert Reich Exposing 7 GOP Lies? http://front.moveon.org/what-if-everyone-saw-this-clip-of-robert-reich-exposing-7-gop-lies