I just had a 4-day weekend, in which I got little more accomplished than a usual 2-day weekend.
Much of Saturday was spent trying to use PacBio reads to improve a draft genome of a V. cholerae strain that I had built with 454 reads a couple of years ago. There was no problem getting blasr to map the reads, and I could call variants with “samtools mpileup”, though that took 2 CPU days to complete. Unfortunately, that did not tell me what I really needed to know, which was whether the orignal assembly was in the right order. I found a couple of places where the PacBio read mapping indicated problems (either the reads all terminated their mapping at nearly the same point, or they suddenly switched from aligning very well to aligning poorly). Unfortunately, I’ve not yet figured out a good way to automate this detection, so I’m not sure I can find all the places which might have problems. Dips in average quality of the mpileup consensus over 50- or 100-base windows pulls out the places where the alignments get bad, but not where they suddenly stop. Furthermore, once I’ve identified the bad regions, I still need to break the genome apart there, rebuild the bad regions from the PacBio reads that map nearby, and see if I can stitch the genome back together (probably with extra repeats that had not been resolved in the 454 assembly). I’m considering backing off and building a new genome assembly from just the PacBio reads (after cleaning them up using PacBio2CA and the 454 reads) and the Celera Assembler. I can then compare the genome built from the PacBio reads and the one built from the 454 reads and resolve any discrepancies. Sigh, this project keeps getting bigger, just as I think I’m almost done.
On Sunday, I did a bunch of small tasks: raked leaves and shredded them, updated the grad alumni web page, announced the Freshman Design Seminar class (which will happen winter quarter, though once again as a “Group Tutorial” to prototype the course before submitting the official paperwork), wrote a letter of recommendation for a student applying to grad schools, scanned in the flyer for “Planet of the Abes” (the recent Dinosaur Prom show), scanned 35-year-old t-shirt of mine so that I can get another copy made, updated my paper list to include the just released PNAS paper, wrote a blog post, and caught up with a lot of my e-mail (though there are still some advising e-mails that I haven’t taken care of).
On Monday, I did a lot of grading, wrote another blog post, used the Planet of the Abes flyer to make titles for the Dinosaur Prom video, and rendered the video (tying up my laptop all night). I also cleaned up the scan of the old t-shirt and converted it to SVG so that a new silk screen printing can be done. I’ve tried looking for the copyright holder for the design, but I have no idea how to find him (or her)—Google image searches bring up nothing similar, and there is no signature on the design or the shirt. I started working on my slides for the talk I have to give on Thursday, but did not get much done.
This morning, I responded to more e-mail, wrote another blog post, did more grading, and returned the activity monitor I’ve been wearing for the past 2 weeks to the Sleep Center. In the afternoon, I did more grading at Gayle’s Bakery in Capitola, met with my son’s consultant teacher for a couple of hours, bought the usual weekly load of soy milk (only 2.5 gallons this week), did some other grocery shopping, finished the grading, recorded the grades, cleared the rest of the advising e-mail, and compared results on group theory problems with my son. We’re a bit behind schedule there—he’s not finished all the Chapter 1 and Chapter 2 problems I assigned, and we’re supposed to be finishing Chapter 3 this week—I’ve not even assigned Chapter 3 problems yet.
Things I wanted to do this weekend but didn’t:
- Get the slides done for Thursday’s talk
- Get the Program Learning Objectives written for bioinformatics
- Get assessment plans defined and written for the Program Learning Objectives for both bioengineering and bioinformatics.
- Create a draft of a revised curriculum for the third track in bioengineering (which also needs a new name and a clearer focus).
- Rewrite the handout for the next programming assignment in the Bioinformatics: Models and Algorithms courss.
- Write code for looking for regions of the Helicobacter pylori genome that are possibly swapped in the current assembly and test for which rearrangement is most consistent with our data.
- Start testing the BitScope differential input device they sent me.
- Start working on Chapter 3 problems in group theory.
- Start writing a paper on the segmenter that I described in my blog 3 months ago.
- Clear the leaves off the roof before the rains start, since the leaves form dams that keep the rain from running off into the gutters properly.
There were probably other things, but I forget what they were now. Once the to-do list gets longer than my piece of paper can hold, things fall off it.