Tip of the Week: Word Add-In for Ontology Recognition

17 March, 2010 (08:07) | Tip of the Week | By: Jennifer

In today’s tip I want to make you aware of a tool that I think will help researchers to present their own data and publications in an accurate and universally searchable way. I learned of the resource (UCSDBioLit) through an article in one of my recent BioMed Central article alert emails. This resource allows authors to mark-up their own publications with XML tags AS THEY WRITE their papers. This will allow faster and more accurate semantic searching of their research.

A huge problem in science today is the ability to quickly search the vast literature base and to accurately and efficiently find the data that you are interested in. Here at OpenHelix we focus on ways of effectively and efficiently get information out of public databases and resources, but at the other end of the process is the ability for scientific knowledge to be curated into those resources. We have featured biocurators and the phenomenal work that they do several times in the past, but it is work that never ends and can be very labor intensive. It often involves an initial triaging of a field’s literature, some level of automatic information gathering, and then careful manual effort on the part of scientist at the resource to gather and present the information through their site. I know from personal experience that the process of reading a paper, clarifying research details with an author, and then presenting that information to the author’s satisfaction can be a very long & labor intensive process, for both the curator AND the original author.

For years there has been discussion of ‘expert curation’ in which experts in the field author review or summary pages in a resource, or community curation jamborees, etc. And there have been fruits from many of these efforts, but in general participation is low. But who is more of an expert on the research being published other than the author himself? If authors could/would mark up their own papers during the publication process, not only could they be assured that it would be accurate but they would help make their research universally searchable without the lag required for searchability through a specific resource. Thus far document mark-up is has not been an easy process and has largely been deemed ‘not worth the effort’ for the level of attribution/recognition affiliated with it.

The BioMed Central article does a nice job of outlining and discussing many of these issues. It cites many other efforts and resources, explains their motivation and the implementation of their software. A nice feature of the tool is that there are interoperability features, and a real commitment to conforming with existing standards of practice. The article also presents an appendix of resource addresses of other groups involved in semantic searching and literature publication. I especially like this quote from the paper:

The Word add-in presented here will assist authors in this effort using community standards and by making it possible for the author of the document, the absolute expert on the content, to do so during the authoring process and to provide this information in the original source document.

You can also find brief tutorials on using the tool at SciVee: Word Add-in for Ontology Recognition Tutorial (1 of 4): Install Process

As a note, literature mark-up and enabling are currently an active area – Mary found another literature handling resource and paper as well: Check out the tip, the articles & the tools. Tell me what you find/think. Thanks! (OH, and Happy St. Patty’s to ya!)

UCSDBioLit Reference:
ResearchBlogging.org
Fink, J., Fernicola, P., Chandran, R., Parastatidis, S., Wade, A., Naim, O., Quinn, G., & Bourne, P. (2010). Word add-in for ontology recognition: semantic enrichment of scientific literature BMC Bioinformatics, 11 (1) DOI: 10.1186/1471-2105-11-103

Comments: -

Another day, another genome: Hydra

16 March, 2010 (08:09) | Genomics Research | By: Mary

Ok, I had no idea hydra was even in the pipeline.  I’m a fan of hyda from the days way back in the developmental biology lab.  We were assaulting hydra in various ways and watching the regeneration ensue.  It was very compelling to see how they reacted to removed tentacles, split body axes, and such.

So when I saw the news about this paper in Nature I went over for a look: The dynamic genome of Hydra is an early look at the genome–the assembly isn’t complete.  But it still has some useful tidbits.  About 20,000 genes at this point, some are clearly in known pathways such as Hox members and Wnt signalling genes.

The best chuckle I got out of the paper, though, is the transposable elements.  I still haven’t recovered from the corn TEs (which make up more than 80% of corn), and here we go again.

Transposable elements make up ~57% of the Hydra genome and represent over 500 different families….

I’m beginning to think that all of this biology stuff is just a way to disperse transposable elements.  Who knew?

Comments: -

FYI: NCBI Has Officially Announced New dbVar Resource

15 March, 2010 (15:55) | Genomics News, Genomics Resource News, New Resource | By: Jennifer

As I was browsing over NCBI’s homepage, I happened to notice an announcement dated March 2nd that stated that the dbVar resource that Mary mentioned briefly in a weekly tip a while back is now publicly available. Here’s the brief announcement:

Tuesday, March 02, 2010, 1:00:00 PM
NCBI’s new database of Genomic Structural Variation (dbVar) archives large scale genomic variation data as well as associations of defined variants with phenotypic information.

You can find this announcement and others here.

From the dbVar documentation, it looks like it is mostly in ‘collection mode’ at the moment with lots and lots of data being added, FAQs on how to submit to dbVar, and some background information on what structural variation is, and how it is detected. It looks like the actual graphical displays of the variations use NCBI’s Sequence Viewer. It will be interesting to see how this new NCBI resource grows and is utilized.

edit: 3/16 9am – links to dbVar all appear to be down today. We have an email in to NCBI & will keep you posted on anything that we hear from them.

edit 2: 3/16 1pm – The links to dbVar are working for me now. Thanks, NCBI, for the quick fix!

Comments: 2

Genetic Dieting

12 March, 2010 (16:09) | General Science, Genomics News | By: Trey

Just some light Friday reading. I’ve seen ads and articles about “body type diets” and diets supposedly based on our genetics, “Genotype Diet.”  It has interested me. I gained 80lbs over the last 10 years and spend most of that time trying different methods (low carb, etc, etc) to lose that. Finally lost 60 using that old-fashioned way, Weight Watchers :D . And though I’m sure there are differences in how we individually eat, crave and put on/take off weight based on our genotypes, I’ve never seen solid scientific evidence that we can determine what those genotypes are in such a way that we can figure out the ‘best’ diet for each of us. Frankly, I’ve come to the conclusion the “best” diet for all of is to each whole fresh food, less of it and lots of plants (where have I read that?), so I tend to believe the complication of trying to figure out what our ‘genotype diet’ might be is wasted effort for the most part (except when it comes to things like gluten intolerance, etc. of course) and that most of these ‘diets’ are, to put it in scientific terms, hokum.

Yet, a new study was just published that suggests that our genes do indeed (as our anecdotal experiences suggest) have an effect on how we respond to different ways of eating. Yet, the study is a small sample (58 women) and there are some other caveats (like the sponsor of the study does a genetic test to determine the best diet :/), but it does look interesting: Genetically Determined Dieting? Maybe Not Yet – Shots – Health News Blog : NPR.

I’ll wait till more extensive, conclusive study comes out to say what effects our genes have on what diet we respond to best, but frankly, most of us would probably do pretty well eating a mainly a lot of of whole fruit, vegetables, nuts, legumes and grains with a bit of lean meat and dairy (if we are so inclined). Very simple (not easy, but quite simple).

Comments: -

What’s your problem? Open Thread

11 March, 2010 (08:14) | What's Your Problem? | By: Mary

wyp_q_mark2_thumbnail1Welcome to the “What’s Your Problem?” (WYP) open thread. The purpose of this entry is to allow the community to ask questions on the use of genomics resources. Think of us as a virtual help desk. If you have a question about how to access a certain kind of data, or how to use a database, or what kind of resources there are for your particular research problem, just ask in the comments. OpenHelix staff will keep watch on the comment threads and answer those questions to the best of our knowledge. Additionally, we encourage readers to answer questions in the comments too. If you know the answer to another reader’s question, please chime in! The “WYP” thread will be posted every Thursday and remain at the top of the blog for 24 hours. Questions or problems asked on Thursday will be answered on Thursday to the best of our ability. You can leave questions on other days of the week, but the answer might not come that day.

We’d also like to invite resource providers to let us know if they have something new to talk about, or something they want to mention to the bioinformatics community. We’ve had some people email us because they weren’t sure if they should post something, and we want to say that’s fine.

So What’s Your Problem? And What’s Your Solution? :)

You can keep up with this thread by remembering to check back, by subscribing to the RSS comments feed to this WYP post or by subscribing to be notified by email of new comments to the post (use checkbox at end of comment form, you can unsubscribe later). If you want to be notified of future WYP posts (every Thursday), you can subscribe to the WYP feed.

Comments: -

Come to the carnival….

11 March, 2010 (08:35) | Genomics Resource News | By: Mary

There’s a new carnival in town.  It was all the buzz on the #bioinformatics twitter feed the last couple of days.  Byte Size Biology delivers the first edition of:

Bioinformatics Blog Carnival #1

For those of you who aren’t as immersed in science blogginess as others, a carnival is a collection of blog posts usually around a given theme or goal.  They are nice–they generate networks in the community, and you might find blogs and bloggers that you wouldn’t have been aware of otherwise.  In this case there is quite a range of topics, maybe there will be something you’ll connect with.  Go check it out!

Bioinformatics Blog Carnival #1

I would have submitted something if I had heard about it in time, but I was away. Alas. There’s always next time. But I can’t fingure out when next time is yet…

Comments: -

Tip of the Week: HapMap data in Haploview

10 March, 2010 (00:01) | Tip of the Week | By: Trey

HapMap has had a few minor updates to their browser, and importantly, new phase 3 data was released early last year (drafts of that data were released in 2008). Haploview, the downloaded software that allows the user to perform in depth LD and haplotype analysis, has been recently updated from version 4.1 to version 4.2. Haploview can be used with user data or data downloaded from the HapMap project. Though, version 4.1 did not work for phase III HapMap project data, so the user had to use phase I and II data if they wanted to use version 4.1. Haploview has now been updated to version 4.2, allowing the user to use HapMap phase III data.

That’s a lot of versions and phases :) . The short of it is, if you use Haploview 4.2, you can view and analyze data from any phase of the HapMap project.

Today’s tip briefly shows you how to download data from the HapMap project and view it in Haploview.

Comments: -

What would you do with your genome?

9 March, 2010 (10:50) | General Science, Genomics Research | By: Mary

So last week I treated myself to my first vacation in a long time.  It was my birthday, and I wanted to disconnect a bit and recharge.  Mostly it worked, although the hundreds of emails I’m facing this morning are a bit daunting.  But just before I left I got an email from a colleague who asked me a really great question:

….I would love to know where you would start when you get back a personal genome sequence….

And I couldn’t shake this out of my head.  I was sitting on a bridge outside Windsor Castle thinking about it as the sun set on my first day.  (On subsequent days I found that the far superior ciders in the UK were able to push this question out of my head for some periods of time. And also pie.)

I’ve spent some significant time thinking about the onslaught of personal genomics, of course.  It’s all been very theoretical, because I would have refused to even begin the process of obtaining my personal genome sequence until the GINA legislation fully kicked in.  But now that barrier is down.  I’m still not ready to get mine done for a variety of reasons (cost, quality, informative value).  But it’s still worth thinking about what I would do with it if it was handed to me–in specific terms, with concrete actions.  So here’s what I decided I would do.  Your mileage may vary.  And I’d love to hear what others might do with theirs.  Follow the link for the specific actions I’d take.

Click to continue reading “What would you do with your genome?”

Comments: 8

MyNCBI and eRA Commons

8 March, 2010 (17:22) | General Science, Genomics Research | By: Trey

If you use MyNCBI (which I like) and eRA Commons (got an NIH grant? Then you probably do), MyNCBI now has a handy new feature I just discovered (of course if I looked at the NCBI news last month, I’d have discovered it earlier :D ) and thought I’d point out.
The new feature allows you to link your bibliography in MyNCBI (which is absurdly easy to build :) with your eRA commons account. Then, when you view your bibliography, you’ll find a new $ icon. Click that icon and it will give you the following information about all your citations: Whether the citation complies with NIH’s (sort of) new public access policy and if there is NIH funding associated with the citation (which you can link if you have an open grant in your eRA Commons account). Nice feature you might want to check out, and it might be a nice subject for a Tip of the Week in a couple weeks :) .

Comments: -

Mary’s post is ResearchBlogging “Editor’s Selection”

5 March, 2010 (16:50) | General Science | By: Trey

This post was chosen as an Editor's Selection for ResearchBlogging.orgJust wanted to point out that our fellow blogger Mary had an excellent post the other day entitled “What if Bt saved human lives?” over at Biofortified, which she mentioned here earlier.

Well, it was chosen at “Research Blogging” as an editor’s selection this week. Check it out when you’ve got a moment.

Comments: -
WordPress Loves AJAX