Murphy_PSLID_BOSC2009

35
PSLID, the Protein Subcellular Loca4on Image Database: Subcellular loca4on assignments, annotated image collec4ons, image analysis tools, and genera4ve models of protein distribu4ons 1 Estelle Glory, Jus.n Newberg, Tao Peng, Ivan Cao‐Berg, and Robert F. Murphy Departments of Biological Sciences, Biomedical Engineering and Machine Learning and
  • date post

    21-Oct-2014
  • Category

    Technology

  • view

    579
  • download

    0

description

 

Transcript of Murphy_PSLID_BOSC2009

Page 1: Murphy_PSLID_BOSC2009

PSLID,theProteinSubcellularLoca4onImageDatabase:

Subcellularloca4onassignments,annotatedimagecollec4ons,image

analysistools,andgenera4vemodelsofproteindistribu4ons

1

EstelleGlory,Jus.nNewberg,TaoPeng,IvanCao‐Berg,andRobertF.Murphy

DepartmentsofBiologicalSciences,BiomedicalEngineeringandMachineLearningand

Page 2: Murphy_PSLID_BOSC2009

Contributors•  MichaelBoland•  MiaMarkey•  GregoryPorreca•  MeelVelliste•  KaiHuang•  XiangChen•  YanhuaHu•  JuchangHua•  TingZhao•  Shann‐ChingChen•  ElviraOsunaHighley•  Jus4nNewberg•  EstelleGlory•  TaoPeng•  LuisCoelho•  IvanCao‐Berg

•  DavidCasasent•  SimonWatkins•  JonJarvik,PeterBerget•  JackRohrer•  TomMitchell•  ChristosFaloutsos•  JelenaKovacevic•  GeoffGordon•  B.S.Manjunath,AmbujSingh•  LesLoew,IonMoraru,JimSchaff•  GustavoRohde•  GhislainBonamy,SumitChanda,

DanRines

Page 3: Murphy_PSLID_BOSC2009

Overview

•  SLIC–  SubcellularLoca.onImageClassifica.on,Clustering,Comparison

•  PUnMix–  SubcellularPaVernUnmixing

•  SLMLTools– Genera.veModelsofCellsandSubcellularOrganelles

•  PSLID–  ProteinSubcellularLoca.onImageDatabase

Page 4: Murphy_PSLID_BOSC2009

TheChallenge

 Comparisonofcellimagespixel‐by‐pixelorregion‐by‐regionmatchingdoesnotworkforcellpaVernsbecausedifferentcellshavedifferentshapes,sizes,orienta4ons

 Organelles/structureswithincellsarenotfoundinfixedloca4ons

 Instead,describeeachimagenumericallyandoperateonthedescriptors(“SLF”‐SubcellularLoca=onFeatures)

Page 5: Murphy_PSLID_BOSC2009

SLICtoolcategories

•  Segmenta.on•  Featurecalcula.on•  Classifica.on•  Clustering•  Comparison

Page 6: Murphy_PSLID_BOSC2009

Featurelevelsandgranularity

Object features

Single Object

Single Cell

Single Field

Cell features

Field features

Granularity: 2D, 3D, 2Dt, 3Dt

Aggregate/averageoperator

Page 7: Murphy_PSLID_BOSC2009

ER

Tubulin DNATfRAc.n

NucleolinMitoLAMP

gpp130gian.n

2DImagesofHeLacells

40

50

60

70

80

90

100

40 50 60 70 80 90 100

Computer Accuracy

Hu

man

Acc

ura

cySubcellularPaVernClassifica.on:

Computervs.Human

EvenbeVerresultsusingmul.resolu.onmethods

EvenbeVerresultsfor3Dimages

Page 8: Murphy_PSLID_BOSC2009

SLICversions–Sourcecode

•  Matlab•  Python•  C++/ITK(subset;fromBadriRoysam’sgroup)

Page 9: Murphy_PSLID_BOSC2009

DecomposingmixturepaVerns

•  Proteinscanbeinmorethanonestructure•  ClusteringorclassifyingwholecellpaVernswillconsidereachcombina.onoftwoormore“basic”paVernsasauniquenewpaVern

•  Desirabletohaveawaytodecomposemixturesinstead

•  Ourapproach:assumethateachbasicpaVernhasarecognizablecombina.onofdifferenttypesofobjects

Page 10: Murphy_PSLID_BOSC2009

PUnMix

•  Learnunmixingmodelinstance•  Unmiximagesusingmodelinstance

Page 11: Murphy_PSLID_BOSC2009

11

ExamplesofObjectTypes

Learnthetypesbyclusteringusingobjectfeatures

Page 12: Murphy_PSLID_BOSC2009

12

34

56

78

Nuclear class

Lysosomal class

Golgi class0

0.1

0.2

0.3

0.4

0.5

Amt fluor.

Object type

12

34

56

78

Nuclear class

Lysosomal class

Golgi class0

0.1

0.2

0.3

0.4

0.5

Amt fluor.

Object type

1 2 34

56

78

Nuclear class

Lysosomal class

Golgi class

All0

0.05

0.1

0.15

0.2

0.25

Amt fluor.

Object type

PureGolgiPaRern

Pure Lysosomal Pattern

Page 13: Murphy_PSLID_BOSC2009

Testsamples

•  HowdowetestasubcellularpaVernunmixingalgorithm?

•  NeedimagesofknownmixturesofpurepaVerns–difficulttoobtain“naturally”

•  Createdtestsetbymixingdifferentpropor.onsoftwoprobesthatlocalizetodifferentcellparts(lysosomesandmitochondria)

Page 14: Murphy_PSLID_BOSC2009

•  Lysotracker

Tao Peng, Ghislain Bonamy, Estelle Glory, Sumit Chanda, Dan Rines (Genome Research Institute of Novartis Foundation)

Page 15: Murphy_PSLID_BOSC2009

•  Mitotracker

Page 16: Murphy_PSLID_BOSC2009

•  MixtureofLysotrackerandMitotracker

Page 17: Murphy_PSLID_BOSC2009

PaVernunmixingresults

17

Page 18: Murphy_PSLID_BOSC2009

PUnMixversions

•  Opensource–MatlabincludingC++•  Compiledversions(notrequiringMatlablicense)forMacOS,Windows,Linux

Page 19: Murphy_PSLID_BOSC2009

SLMLTools‐Genera.vemodelsofsubcellularpaVerns

•  Buildmodelinstancefromimagecollec.on•  Generateimagesfrommodelinstance

•  Viewmul.‐paVernimages

Page 20: Murphy_PSLID_BOSC2009

LAMP2paVern

Nucleus

Cell membrane

Protein

Page 21: Murphy_PSLID_BOSC2009

NuclearShape‐MedialAxisModel

Rotate

Medial axis Represented by two curves

the medial axis width along the medial axis

width

Page 22: Murphy_PSLID_BOSC2009

Synthe.cNuclearShapes

Page 23: Murphy_PSLID_BOSC2009

Withaddednucleartexture

Page 24: Murphy_PSLID_BOSC2009

CellShapeDescrip.on:DistanceRa.o

d1

d2 2

21

dddr +

=

Page 25: Murphy_PSLID_BOSC2009

Genera.on

Page 26: Murphy_PSLID_BOSC2009

Modelsforprotein‐containingobjects

•  MixtureofGaussianobjects

•  Learndistribu.onsfornumberofobjectsandobjectsize

•  Learnprobabilitydensityfunc.onforobjectsrela.vetonucleusandcell

r:normalizeddistance,a:angletomajoraxis

Page 27: Murphy_PSLID_BOSC2009

SynthesizedImages

Lysosomes Endosomes

  HaveXMLdesignforcapturingmodelparameters  Haveportabletoolforgenera.ngimagesfrommodel

SLMLtoolbox‐IvanCao‐Berg,TaoPeng,TingZhao

27

Page 28: Murphy_PSLID_BOSC2009

ModelDistribu.on

•  Genera.vemodelsprovidebeVerwayofdistribu.ngwhatisknownabout“subcellularloca.onfamilies”(orotherimagingresults,suchasillustra.ngchangeduetodrugaddi.on)

•  HaveXMLdesignforcapturingthemodelsfordistribu.on

•  Haveportabletoolforgenera.ngimagesfromthemodel

Page 29: Murphy_PSLID_BOSC2009

CombiningModelsforCellSimula.ons

Protein 1 Cell Shape

Nuclear Model

Protein 2 Cell Shape

Nuclear Model

Protein 3 Cell Shape

Nuclear Model

XML

Simulation

Shared Nuclear and Cell

Shape

Page 30: Murphy_PSLID_BOSC2009

Examplecombina.on

Red=nuclearmembrane,plasmamembraneBlue=GolgiGreen=LysosomesCyan=Endosomes

Page 31: Murphy_PSLID_BOSC2009

SLMLToolsversions

•  Opensource–MatlabincludingC++•  Compiledversions(notrequiringMatlablicense)forMacOS,Windows,Linux

Page 32: Murphy_PSLID_BOSC2009

PSLID

•  Loadingpipelinedrivenbyscript–  Calculatesthumbnailimages,features,segmenta.on–  Createsdatabaserecordsandlinks–  Createspredefinedsets

•  Webapplica.on–  Createsetsbysearchingoncontextorcontent– AnalyzesetswithanySLICtool–  Fulldisplayorsummary

–  SOAP/XMLinterface

Page 33: Murphy_PSLID_BOSC2009

PSLID

•  Opensource•  Linuxonly:tomcat,postgres

Page 34: Murphy_PSLID_BOSC2009

AnnotatedDatasets

•  2Dand3Dimagesof9majorsubcellularpaVernsinHeLacells

•  3Dimagesof~300proteinsin3T3cells

•  2Dimagesof~3000proteinsin3T3cells

•  2Dand3DimagesforpaVernunmixing

•  Datasetsfromotherinves.gators

Page 35: Murphy_PSLID_BOSC2009

•  hVp://murphylab.web.cmu.edu/sooware•  hVp://murphylab.web.cmu.edu/data

•  PastmajorsupportfromNSF

•  CurrentsupportfromNIHNIGMSandNCRR– Na.onalCenterforNetworksandPathways:MolecularBiosensorsandImagingCenter(AlanWaggoner)