您的当前位置：首页 W06-3815

W06-3815

来源：划驼旅游

ContextComparisonasaMinimumCostFlowProblem

VivianTsangandSuzanneStevensonDepartmentofComputerScience

UniversityofToronto

Canada

vyctsang,suzanne@cs.utoronto.ca

Abstract

Comparingwordcontextsisakeycompo-nentofmanyNLPtasks,butrarelyisitusedinconjunctionwithadditionalonto-logicalknowledge.Oneproblemisthattheamountofoverheadrequiredcanbehigh.Inthispaper,weprovideagraphi-calmethodwhicheasilycombinesanon-tologywithcontextualinformation.Wetakeadvantageoftheintrinsicgraphicalstructureofanontologyforrepresentingacontext.Inaddition,weturntheon-tologyintoametricspace,suchthatsub-graphswithinit,whichrepresentcontexts,canbecompared.Wedeveloptwovari-antsofourgraphicalmethodforcompar-ingcontexts.Ouranalysisindicatesthatourmethodperformsthecomparisonefﬁ-cientlyandoffersacompetitivealternativetonon-graphicalmethods.

1Introduction

Manynaturallanguageproblemscanbecastasaproblemofcomparing“contexts”(unitsoftext).Forexample,thelocalcontextofawordcanbeusedtoresolveitsambiguity(e.g.,Sch¨utze,1998),assum-ingthatwordsusedinsimilarcontextsarecloselyrelatedsemantically(MillerandCharles,1991).Ex-tendingthemeaningofcontext,thecontentofadocumentmayrevealwhichdocumentclass(es)itbelongsto(e.g.,Xuetal.,2003).Inanyappli-cation,onceasensibleviewofcontextisformu-lated,thenextstepistochoosearepresentationthatmakescomparisonspossible.Forexample,inword

sensedisambiguation,acontextofanambiguousinstancecanberepresentedasavectorofthefre-quenciesofwordssurroundingit.Untilrecently,thedominantapproachhasbeenanon-graphicalone—contextcomparisonisreducedtoataskofmeasuringdistributionaldistancebetweencontextvectors.Thedifferenceinthefrequencycharacteristicsofcon-textsisusedasanindicatorofthesemanticdistancebetweenthem.

Wepresentagraphicalalternativethatcombinesbothdistributionalandontologicalknowledge.Webeginwiththeuseofadifferentcontextrepresen-tationthatallowseasyincorporationofontologicalinformation.Treatinganontologyasanetwork,wecanrepresentacontextasasetofnodesinthenet-work(i.e.,conceptsintheontology),eachwithaweight(i.e.,frequency).TocontrastourworkwiththatofNavigliandVelardi(2005)andMihalcea(2006),thegoalisnotmerelytoprovideagraph-icalrepresentationforacontextinwhichtherele-vantconceptsareconnected.Rather,contextsaretreatedasweightedsubgraphswithinalargergraphinwhichtheyareconnectedviaasetofpaths.Byin-corporatingthesemanticdistancebetweenindivid-ualconcepts,thegraph(representingtheontology)becomesametricspaceinwhichwecanmeasurethedistancebetweensubgraphs(representingthecon-textstobecompared).

Morespeciﬁcally,measuringthedistancebe-tweentwocontextscanbeviewedassolvingamin-imumcostﬂow(MCF)problembycalculatingtheamountof“effort”requiredfortransportingtheﬂowfromonecontexttotheother.Ourmethodhastheadvantageofincludingsemanticinformation(bymakinguseofthegraphicalstructureofanontol-ogy)withoutlosingdistributionalinformation(by

WorkshoponTextGraphs,atHLT-NAACL2006,pages97–104,

c2006AssociationforComputationalLinguisticsNewYorkCity,June2006.󰀁

usingtheconceptfrequenciesderivedfromcorpusdata).

Thisnetworkﬂowformulation,thoughsupport-ingtheinclusionofanontologyincontextcompari-son,isnotﬂexibleenough.Theproblemisrootedinthechoiceofconcept-to-conceptdistance(i.e.,thedistancebetweentwoconcepts,tocontrastitfromtheoverallsemanticdistancebetweentwocontexts).Certainconcept-to-conceptdistancesmayresultinadifﬁcult-to-processnetworkwhichseverelycompro-misesefﬁciency.Toremedythis,weproposeanovelnetworktransformationmethodforconstructingapared-downnetworkwhichmimicsthestructureofthemoreprecisenetwork,butwithouttheexpensiveprocessingoranysigniﬁcantinformationlossasaresultofthetransformation.

Intheremainderofthispaper,weﬁrstpresenttheunderlyingnetworkﬂowframework,anddevelopamoreefﬁcientvariantofit.Wethenevaluatetherobustnessofourmethodsonacontextcomparisontask.Finally,weconcludewithananalysisandsomefuturedirections.

2TheNetworkFlowMethod

2.1

MinimumCostFlow

AsastandardexampleofanMCFproblem,considerthegraphicalrepresentationofaroutemapfordeliv-eringfreshproducefromgrocers(supplynodes)tohomes(demandnodes).Theremainingnodes(e.g.,intersections,gasstations)haveneitherasupplynorademand.Assumingtherearesufﬁcientsupplies,theoptimalsolutionistoﬁndthecheapestsetofroutesfromgrocerstohomessuchthatalldemandsaresatisﬁed.

Mathematically,letbeaconnectednetwork,whereisthesetofnodes,andisthe

1setofedges.Eachedgehasacost,

whichisthedistanceoftheedge.Eachnodeisassociatedwithavaluesuchthatindicatesitsavailablesupply(),itsdemand(),orneither().Thegoalistoﬁndasolutionforeachnodesuchthatalltheﬂowpassingthroughsatisﬁesitssupplyordemandrequirement().Theﬂowpassingthroughnodeiscapturedbysuchthatwecanobservethecom-98

demand.Thecostoftheroutesbetweennodesisdeterminedbyasemanticdistancemeasuredeﬁnedoveranytwonodesintheontology.Now,asinthegrocerydeliverydomain,thegoalistoﬁndtheMCFfromsupplytodemand.

Wecantreatanyontologyasthetransportnet-work.Arelation(suchashyponymy)betweentwoconceptsonandeachisedgerepresentedcanbedeﬁnedbyanedgeastheseman-,andthecostticdistancebetweenthetwoconcepts.Thisseman-ticdistancecanbeassimpleasthenumberofedgesseparatingtheconcepts,ormoresophisticated,suchasLin’s(1998)information-theoreticmeasure.(SeeBudanitskyandHirst(2006)forasurveyofsuchmeasures).

Numerousmethodsarepossibleforconvertingthewordfrequencyvectorofacontexttoaconceptfrequencyvector(i.e.,acontextproﬁle).Onesimplemethodistotransfereachelementinthewordvector(i.e.,thefrequencyofeachword)tothecorrespond-ingconceptsintheontology,resultinginavectorofconceptfrequencies.Inthispaper,wehavecho-senauniformdistributionofwordfrequencycountsamongconcepts,insteadofaweighteddistributiontowardstherelevantconceptsforaparticulartext.SincewewishtoevaluatethestrengthofourmethodalonewithoutanyadditionalNLPeffort,webypasstheissueofapproximatingthetruedistributionoftheconceptsviawordsensedisambiguationorclass-basedapproximationmethods,suchasthosebyLiandAbe(1998)andClarkandWeir(2002).

Tocalculatethedistancebetweentwoproﬁles,weneedtocastoneproﬁleasthesupply(thatour)distanceandtheotherasthedemand().Noteissymmetric,sothechoiceofthesupplyandthedemandisarbitrary.Next,wemustdeterminethevalueofateachconceptnode;thisisjustthedifferencebetweenthe(normalized)supplyfre-quencyanddemandfrequency:

(4)

Thisformulayieldsthenetsupply/demand,,atnode.Recallthatourgoalistotransportallthesup-plytomeetthedemand—theﬁnalstepistodeter-minethecheapestroutesbetweenandsuchthattheconstraintsin(2)and(3)aresatisﬁed.Thetotal

distanceoftheroutes,ortheMCF,

ineqn.(1),isthedistancebetweenthetwocontextproﬁles.

Finally,itisimportanttonotethattheMCFfor-mulationdoesnotsimplyﬁndtheshortestpaths

fromtheconceptnodesinthesupplytothoseinthedemand.Becauseaproﬁleisafrequency-weightedconceptvector,someconceptnodesareweightedmoreheavilythanothers,andtheroutesbetweensuchnodesacrossthetwoproﬁlesarealsoweightedmoreheavily.Indeed,ineqn.(1),thecostofeachroute,,isweightedby(howmuchsup-ply,orfrequencyweight,istransportedbetweennodesand).

3GraphicalIssues

Asalludedtointheintroduction,certainconcept-to-conceptdistancesposeaproblemtosolvingtheMCFproblemeasily.Thedetailsaredescribednext.3.1

Additivity

Intheory,ourmethodhastheﬂexibilitytoincorpo-ratedifferentconcept-to-conceptdistances.Theis-sueliesinthealgorithmsforsolvingMCFproblems.Existingalgorithmsaregreedy—theytakeastep-wise“localist”approachonthesetofedgesconnect-ingthesupplyandthedemand;i.e.,ateachnode,thecheapestoutgoingedgeisselected.Theassump-tionisthattheconcept-to-conceptdistancefunctionisadditive.Mathematically,foranypathfromnodetonode,,whereand,thedistancebetweennodesandisthesumofthedistanceoftheedgesalongthepath:

(5)

Theadditivityofaconcept-to-conceptdistanceen-tailsthatselectingthecheapestedgeateachstep

(i.e.,locally)yieldstheoverallcheapestsetofroutes(i.e.,globally).Notethatsomeofthemostsuccess-fulconcept-to-conceptdistancesproposedintheCLliteraturearenon-additive(e.g.,Lin,1998;Resnik,1995).Thisposesaprobleminsolvingournetworkﬂowproblem—theglobaldistancebetweenanycon-cepts,and,cannotbecorrectlydeterminedbythegreedymethod.

3.2

ConstructinganEquivalentBipartiteNetwork

Theissueofnon-additivedistancescanbeaddressedinthefollowingway.Wemaptherelevantportion

Figure2:Anillustrationofthetransformations(lefttoright)fromtheoriginalnetwork(a)tothebipartitenetwork(b),andﬁnally,tothenetworkproducedbyourtransformation(c),giventwoproﬁlesSandD.Nodeslabelledwitheither“S”or“D”belongtothecorrespondingproﬁle.Nodeslabelledwith“”or“”arejunctionnodes(seesection4.2).

ofthenetworkintoanewnetworksuchthattheconcept-to-conceptdistanceispreserved,butwith-outtheproblemintroducedbynon-additivity.Onepossiblesolutionistoconstructacompletebipar-titegraphbetweenthesupplynodesandthedemandnodes(thenodesinthetwocontextproﬁles).Weset

inthebipartitegraphtothecostofeachedge

betheconcept-to-conceptdistancebetweenandintheoriginalnetwork.Sincethereisexactlyoneedgebetweenanypairofnodes,thenon-additivityisremovedentirely.(SeeFigures2(a)and2(b).)Now,wecanapplyanetworkﬂowsolveronthenewgraph.

However,oneproblemarisesfromperformingtheabovemapping—thereisaprocessingbottleneckasaresultofthequadraticincreaseinthenumberofedgesinthenewnetwork.Unfortunately,thoughtractable,polynomialcomplexityisnotalwaysprac-tical.Forexample,withanaverageof900nodesperproﬁle,making120proﬁlecomparisonsinaddi-tiontonetworkre-structuringcantakeaslongas10days.2Ifwechoosetouseanon-additivedistance,themethoddescribedabovedoesnotscaleupwellforalargenumberofcomparisons.Next,wepresentamethodtoalleviatethecomplexityissue.

4NetworkTransformation

Onemethodofalleviatingthebottleneckistoreducetheprocessingloadfromgeneratingalargenumber

100

non-additivityissue,bygeneratinganedgewiththeexactconcept-to-conceptdistanceforeachpotentialnodecomparison,but,asnotedabove,istooinef-ﬁcient.Oursolutionhereistoconstructanetworkthatusestheideaofapared-downA-shapedpathtomostlyavoidnon-additivity,butwithouttheinefﬁ-ciencyofthecompletebipartitegraph.Thus,asex-plainedinmoredetailinthefollowingsubsections,wetradeofftheexactnessofthedistancecalculationagainsttheefﬁciencyofthenetworkconstruction.4.2

NetworkConstruction

onthetransformednetwork.Observethateachedge

,withcost,inthecompletebipartitenetwork,where,,isnowinsteadrepre-sentedbythreeedges:,,and,whereand.Thus,thetransformed

,becomes:distancebetweenand,

(6)

Inournetworkconstruction,weexploitthegeneral

notionofanA-shapedpathbetweenanytwonodes,butreplacethe“tip”oftheAwithtwonodes.Thenforeachnodeandinproﬁlesand,wegen-erateanedgefromstoanancestorof(theleft“branch”oftheA),anedgefromdtoanan-of(theright“branch”oftheA),andancestor

edgebetweenand(thetwonodesformingthe“elongatedtip”oftheA).Eachedgehastheexactconcept-to-conceptdistancefromtheoriginalnet-work,sothatthedistancebetweenanytwonodesandisthesumofthreeexactdistances.

Thesetofancestornodes,and,comprisethe“junction”pointsatwhichthesupplyfromcanbetransportedacrosstothenodesintosatisfytheirdemand.Thesetofjunctionnodes,,forapro-ﬁle,mustbeselectedsuchthatforeachnodein,containsatleastoneancestorof.(Seesection4.4fordetailsonthejunctionselectionpro-cess.)Theresultingnetworkisconstructedbydi-rectlyconnectingeachproﬁletoitscorrespondingjunction,thenconnectingthetwojunctionsinthemiddle(Figure2(c)).

Thedifferencebetweenthecompletebipartitenetworkandthetransformednetworkhereisthat,insteadofconnectingeachnodeintoeverynodein,weconnecteachnodeintoeverynodein.ComparethetransformednetworkinFig-ure2(c)withthecompletebipartitenetworkinFig-ure2(b).Thecompletebipartitecomponentinthetransformednetwork(themiddleportionbetweenthejunctionnodeslabelledand)isconsid-erablysmallerinsize.Thus,thenumberofedgesinthetransformednetworkissigniﬁcantlyfeweraswell.

Next,wecanproceedtodeﬁnethecostfunction

whereisthepreciseconcept-to-conceptdistancebetweenandintheoriginalnetwork.Oncewehavesetupthetransformednetwork,wecansolvetheMCFinthisnetwork,yieldingthedis-tancebetweenthetwo(supplyanddemand)proﬁles.

4.3DistanceDistortion

Becausethedistancebetweennodesandisnowcalculatedasthesumofthreedistances(eqn.(6)),somedistortionmayresultfornon-additiveconcept-to-conceptdistances.Toillustratethedistortionef-fect,considerJiangandConrath’s(1997)distance:

(7)

whereistheinformationcontentofanode,andisthelowestcommonsubsumerofnodesand.Thisdistancemeasuresthedif-ferenceininformationcontentbetweentheconceptsandtheirlowestcommonsubsumers.

Afterthetransformation,thedistanceisdistortedinthefollowingway.Ifandhavenocommonjunctionancestor,thenbecomes:

(8)

whereandarethejunctionancestorsofand,respectively.Otherwise,ifandshareacommonancestoratthejunction,then

becomes,

wherethetermineqn.(8)isre-placedby.Ineithercase,thetransformationreplacesthelowestcommonsubsumer

ineqn.(7)withsomeothercommonsubsumer

(or,mentionedabove).Un-less,thedistanceisdistorted

byusingalessprecisequantity,.

Notethattheinformationcontentofaconceptisgivenbyitsmaximumlikelihoodestimatebasedon

101

itsfrequencyinalargecorpus.Anincrementinthefrequencyofaconceptleadstoanincrementinthefrequencyofallitsancestors.Duetothefrequencypercolation,conceptswithasmalldepthtendtoac-cumulatehighercountsthanthosedeeperinthehi-erarchy(notethedifferenceindepth:

).Thus,weexpecttheinforma-tioncontentofaconcepttobehigherthanitsan-cestors,i.e.,aconceptismoresemanticallyspeciﬁcthanitsancestors,whichiscapturedbytheuseofthenegativefunctioninthedeﬁnitionofIC.Thetransformeddistanceisdistortedaccordingly().

rithmsisquadratic(theformer)orcubic(thelatter)

inthenumberofnodesinanetwork,whichisunac-ceptablyexpensiveforourtransformationmethod.Notethattoensureeveryproﬁlenodehasanances-tornodeinthejunction,theselectionprocesshasalinearlowerbound.Tokeepthecostlow,itisbesttokeepalinearcomplexityforthejunctionselectionprocess.However,ifthisisnotpossible,itshouldbesigniﬁcantlylessexpensivethanaquadraticcom-plexity.Wewillempiricallyexploretheprocessfur-therinsection5.3.

5ContextComparison

Asalludedtoearlier,ournetworkﬂowmethodpro-videsanalternativetoapurelydistributionalandnon-graphicalapproachtocontextcomparison.Inthispaper,wewilltestbothvariantsofourmethod(withorwithoutthetransformationinsection4)inanamedisambiguationtaskinwhichthecontextwordswithinasmallwindowsurroundingtheam-biguouswordsarecompared.Ourpreliminaryanal-ysisshowsthatourgeneralnetworkﬂowframeworkisrobustandefﬁcient.5.1

NameDisambiguation

4.4JunctionSelection

Selectionofjunctionnodesisakeycomponentofthenetworktransformation.Trivially,ajunctionconsistingofproﬁlenodesyieldsanetworkequiva-lenttothecompletebipartitenetwork.Thekeyistoselectajunctionthatisconsiderablysmallerinsizethanitscorrespondingproﬁle,hence,cuttingdownthenumberofedgesgenerated,whichresultsinsig-niﬁcantsavingsincomplexity.

Notethatthereisatradeoffbetweentheover-allcomputationalefﬁciencyandthesimilaritybe-tweenthetransformednetworkandthecompletebi-partitenetwork.Thecloserthejunctionsaretothecorrespondingproﬁles,thecloserthetransformednetworkresemblesthecompletebipartitenetwork.Thoughthedistancecalculationismoreaccurate,suchanetworkisalsomoreexpensivetoprocess.Ontheotherhand,therearefewernodesinajunc-tionasitapproachestherootlevel,butthereismoredistortioninthetransformedconcept-to-conceptdis-tance.Clearly,itisimportanttobalancethetwofac-tors.

Selectingjunctionnodesinvolvesﬁndingasmallersetofancestornodesrepresentingthepro-ﬁlenodesinahierarchy.Inotherwords,thejunc-tioncanbeviewedasanalternativerepresentationwhichisageneralizationoftheproﬁlenodes.Inadditiontotheproﬁlenodes,thejunctionnodesarealsoincludedinthetransformednetwork.Theymayprovideextrainformationaboutthecorrespondingcontext.

FindingageneralizationofaproﬁleisexploredintheworksofClarkandWeir(2002)andLiandAbe(1998).Unfortunately,thecomplexityofthesealgo-102

Thegoalfornamedisambiguationistoclassifyeachambiguousinstanceonthebasisofitssurroundingcontext.Oneapproachistouseanunsupervisedmethodsuchasclustering.Thisinvolvesmakingalargenumberofpairwisecomparisonsbetweenin-dividualcontexts.Giventhatthereisanoverheadtoincorporatingontologicalinformation,ournet-workﬂowmethoddoesnotcomputedistancesasef-ﬁcientlyascalculatingapurelyarithmeticdistancesuchascosineorEuclideandistance.Ouralterna-tiveapproachistouseminimaltrainingdata.Us-ingahandfulofcontexts,wecanbuilda“goldstan-dard”proﬁleforeachsenseofanambiguousnamebyusingthecontextwordsofasmallnumberofinstances.Wethencomparethecontextproﬁleofeachinstancetothegoldstandards.Eachinstanceisgiventhelabelofthegoldstandardproﬁletowhichitscontextproﬁleistheclosest.5.2

ExperimentalSetup

Inournamedisambiguationexperiment,weusethedatacollectedbyPedersenetal.(2005)fortheirnamediscriminationtask.Thisdataistakenfrom

NamePairs

Ronaldo/DavidBeckham

0.74

Microsoft/IBM

0.56

Jordan/Egyptian

0.510.53

200(Full)0.800.730.77

200(Trans)0.88

0.98

0.75

0.97

0.76

0.750.76

0.830.820.990.99

Table1:Namedisambiguationresults(accuracy/F-measure)ataglance.Thebaselineistherelativefrequencyofthemajorityname.“200”and“100”givetheaveragedresults(overﬁvedifferentruns)using200and100randomlyselectedtraininginstancesperambiguousname.Theweightedaverageiscalculatedbasedonthenumberoftestinstancespertask.“Full”and“Trans”refertotheresultsusingthefullnetwork(pre-transformation)orthepared-downnetwork(withtransformation),respectively.

theAgenceFrancePressEnglishServiceportionoftheGigaWordEnglishcorpusdistributedbytheLin-guisticDataConsortium.Itconsistsofthecontextsofsixpairsofnames,including:thenamesoftwosoccerplayers(RonaldoandDavidBeckham);anethnicgroupandadiplomat(TajikandRolfEkeus);twocompanies(MicrosoftandIBM);twopoliticians(ShimonPeresandSlobodanMilosevic);anationandanationality(JordanandEgyptian);andtwocountries(FranceandJapan).ThesenamepairsareselectedbyPedersenetal.(2005)toreﬂectarangeofconfusabilitybetweennames.

Eachpairofnamesservesasoneofsixnamedisambiguationtasks.Eachnameinstancecon-sistsofacontextwindowof50words(25wordstotheleftandtotherightofthetargetname),withthetargetnameobfuscated.Forexample,forthetaskofdistinguishing“DavidBeckham”and“Ronaldo”,thetargetnameineachinstancebe-comes“David

Notethatthecomplexityofthisselectionprocessislinear,sinceallproﬁlenodesmustbeexaminedtoensuretheyhaveanancestorinthejunction;anyproﬁlenodeofwhichnojunctionnodeisanancestorisaddedtothejunction.Thisprocesscanonlybeavoidedbyusingjunctionnodesofzerodepthexclu-sively.

103

,whosedepthissmall.Junctionnodeswithasmalldepthdistortthedistancemorethanthosewithalargerdepth.Surprisingly,ourexperimentindicatesthatusingsuchnodesproducesequallygoodorbet-terperformance.Thissuggeststhatselectingajunc-tionwithalargerdepth,atleastforthedatainthistask,isnotnecessary.

SpeedImprovementIncomparisontoourre-portedrunningtimeonthepre-transformationnet-work(120comparisonsrunningfor10days),onthesamemachine,making12,000comparisonscannowbeaccomplishedwithintwohours.Intermsofcomplexity,ifwehaveproﬁlenodesandjunc-tionnodes,thenumberofedgestobeprocessedis

.Giventhatourjunctionshavesignif-icantlyfewernodesthantheoriginalproﬁles,therunningtimeissigniﬁcantlylessthanquadraticinthenumberofproﬁlenodes.

tivetoapurelydistributionalandnon-graphicalap-proach.

Inouron-goingwork,wearefurtherexploringhowthechoiceofjunctioninﬂuencestheperfor-manceofdifferenttypesofconcept-to-conceptse-manticdistances.Forexample,wouldabottom-upjunctionselectionapproach(fromtheproﬁlenodesinsteadoffromtherootlevel)resultinbetterper-formance?Inaddition,weintendtoexaminethegraphicalpropertiesoftheindividualproﬁlesaswellastheroutesbetweentheconceptsacrossproﬁlesselectedbyournetworkﬂowmethods.Suchanaly-seswillhelpusgaininsightintothestrengths(andweaknesses)oftakingadvantageofagraphicalrep-resentationofcontextsaswellastreatinganontol-ogyasametricspaceforcontextcomparisons.

References

Budanitsky,A.andHirst,G.(2006).EvaluatingWordNet-basedmeasuresofsemanticdistance.ComputationalLinguistics.Toappear.

Clark,S.andWeir,D.(2002).Class-basedprobabilityestima-tionusingasemantichierarchy.ComputationalLinguistics,28(2):187–206.

Jiang,J.andConrath,D.(1997).Semanticsimilaritybasedoncorpusstatisticsandlexicaltaxonomy.InProceedingsontheInternationalConferenceonResearchinComputationalLinguistics,pages19–33.

Li,H.andAbe,N.(1998).Wordclusteringanddisambiguationbasedonco-occurrencedata.InProceedingsofCOLING-ACL1998,pages749–755.

Lin,D.(1998).Aninformation-theoreticdeﬁnitionofsimilar-ity.InProceedingsofInternationalConferenceonMachineLearning.

Mihalcea,R.(2006).Randomwalksontextstructures.InPro-ceedingsofCICLing2006,pages249–262.

Miller,G.A.andCharles,W.G.(1991).Contextualcorrelatesofsemanticsimilarity.LanguageandCognitiveProcesses,6(1):1–28.

Navigli,R.andVelardi,P.(2005).Structuralsemanticinter-connections:Aknowledge-basedapproachtowordsensedisambiguation.IEEETransactionsonPatternAnalysisandMachineIntelligence,27(7).

Pedersen,T.,Purandare,A.,andKulkarni,A.(2005).Namediscriminationbyclusteringsimilarcontext.InProceedingsoftheSixthInternationalConferenceonIntelligentTextPro-cessingandComputationalLinguistics.

Resnik,P.(1995).Usinginformationcontenttoevaluatese-manticsimilarityinataxonomy.InProceedingsofthe14thInternationalJointConferenceonArtiﬁcialIntelligence.Sch¨utze,H.(1998).Automaticwordsensediscrimination.ComputationalLinguistics,24(1):97–123.

Xu,W.,Liu,X.,andGong,Y.(2003).Documentclusteringbasedonnon-negativematrixfactorization.InProceedingsofthe26thACMSIGIRConference.

7Conclusions

Wehavegivenanoverviewofournetworkﬂowfor-malismwhichseamlesslycombinesdistributionalandontologicalinformation.Givenasuitableon-tology,acontextvectorofwordfrequenciescanbetransformedintoacontextproﬁle—afrequencydistributionovertheconceptsintheontology.Incontrasttotraditionalnon-graphicalapproachestomeasuringonlythedistributionaldistancebetweencontextvectors,weprovideagraphicalformalismwhichincorporatesboththesemanticdistanceofthecomponentnodesaswellasthedistributionaldiffer-encesbetweenthecontextproﬁles.Bytakingadvan-tageofthegraphicalstructureofanontology,ourmethodallowsasystematicandmeaningfulwayofabstractingoverwordsinacontext,andbyexten-sion,ameaningfulwayofcomparingcontexts.

Oneconcernwithourmethodinitspre-transformationformisitsinabilitytoincorporatesophisticatedconcept-to-conceptsemanticdistancesefﬁciently.Toremedythis,weproposeanoveltech-niquethatmimicsthestructureofthemorecompu-tationallyintensivenetwork.Ourpreliminaryeval-uationshowsthatthetransformationdoesnotham-perthemethod’sabilitytomakeﬁne-grainedseman-ticdistinctions,andthecomputationalcomplexityisdrasticallyreducedaswell.Generally,ournetworkﬂowmethodpresentsahighlycompetitivealterna-104

因篇幅问题不能全部显示，请点此查看更多更全内容

查看全文