<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>

<channel>
	<title>Knowledge from Information by Matthias Broecheler</title>
	<atom:link href="http://www.knowledgefrominformation.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.knowledgefrominformation.com</link>
	<description>Research trends on extracting knowledge from large repositories of information.</description>
	<pubDate>Thu, 23 Aug 2012 20:25:21 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Efficient Multi-View Maintenance in the Social Semantic Web</title>
		<link>http://www.knowledgefrominformation.com/2012/04/20/efficient-multi-view-maintenance-in-the-social-semantic-web/</link>
		<comments>http://www.knowledgefrominformation.com/2012/04/20/efficient-multi-view-maintenance-in-the-social-semantic-web/#comments</comments>
		<pubDate>Fri, 20 Apr 2012 20:24:01 +0000</pubDate>
		<dc:creator>Matthias</dc:creator>
		
		<category><![CDATA[Conference]]></category>

		<category><![CDATA[Publication]]></category>

		<guid isPermaLink="false">http://www.knowledgefrominformation.com/?p=160</guid>
		<description><![CDATA[Conference Poster, International World Wide Web Conference, WWW 2012, Lyon, France
Authors: Matthias Broecheler, Andrea Pugliese, VS Subrahmanian
Direct link to paper
Abstract
The Social Semantic Web (SSW) refers to the mix of RDF data in web content, and social network data associated with those who posted that content. Applications to monitor the SSW are becoming increasingly popular. For [...]]]></description>
			<content:encoded><![CDATA[<p>Conference Poster, International World Wide Web Conference, WWW 2012, Lyon, France<br />
Authors: Matthias Broecheler, Andrea Pugliese, VS Subrahmanian<br />
<a href="http://www2012.wwwconference.org/proceedings/companion/p467.pdf">Direct link to paper</a></p>
<p><strong>Abstract</strong><br />
The Social Semantic Web (SSW) refers to the mix of RDF data in web content, and social network data associated with those who posted that content. Applications to monitor the SSW are becoming increasingly popular. For instance, marketers want to look for semantic patterns relating to the content of tweets and Facebook posts relating to their products. Such applications allow multiple users to specify patterns of interest, and monitor them in real-time as new data gets added to the web or to a social network. In this paper, we develop the concept of SSW view servers in which all of these types of applications can be simultaneously monitored from such servers. The patterns of interest are views. We show that a given set of views can be compiled in multiple possible ways to take advantage of common substructures, and define the concept of an optimal merge. We develop a very fast MultiView algorithm that scalably and efficiently maintains multiple subgraph views. We show that our algorithm is correct, study its complexity, and experimentally demonstrate that our algorithm can scalably handle updates to hundreds of views on real-world SSW databases with up to 540M edges.</p>
<p><strong>Poster</strong><br />
<iframe src="http://www.slideshare.net/slideshow/embed_code/14054293" width="479" height="511" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" style="border:1px solid #CCC;border-width:1px 1px 0;margin-bottom:5px" allowfullscreen> </iframe>
<div style="margin-bottom:5px"> </div>
]]></content:encoded>
			<wfw:commentRss>http://www.knowledgefrominformation.com/2012/04/20/efficient-multi-view-maintenance-in-the-social-semantic-web/feed/</wfw:commentRss>
		</item>
		<item>
		<title>PhD: Social Network Data Management</title>
		<link>http://www.knowledgefrominformation.com/2011/12/20/phd-social-network-data-management/</link>
		<comments>http://www.knowledgefrominformation.com/2011/12/20/phd-social-network-data-management/#comments</comments>
		<pubDate>Tue, 20 Dec 2011 20:32:36 +0000</pubDate>
		<dc:creator>Matthias</dc:creator>
		
		<category><![CDATA[Publication]]></category>

		<guid isPermaLink="false">http://www.knowledgefrominformation.com/?p=146</guid>
		<description><![CDATA[Even long journeys eventually come to an end. After 4 years of intensive research I successfully defended my doctoral dissertation at the University of Maryland. The thesis ties together my work on indexing and querying huge social networks and machine learning on multi-relational data under the common theme of social network data management. A lot [...]]]></description>
			<content:encoded><![CDATA[<p>Even long journeys eventually come to an end. After 4 years of intensive research I successfully defended my doctoral dissertation at the University of Maryland. The thesis ties together my work on indexing and querying huge social networks and machine learning on multi-relational data under the common theme of social network data management. A lot of this work is introduced in individual blog posts on this site.</p>
<p><strong>Title</strong>: Social Network Data Management.</p>
<p><strong>Abstract</strong><br />
With the increasing usage of online social networks and the semantic web&#8217;s graph structured RDF framework, and the rising adoption of networks in various fields from biology to social science, there is a rapidly growing need for indexing, querying, and analyzing massive graph structured data. Facebook has amassed over 500 million users creating huge volumes of highly connected data. Governments have made RDF datasets containing billions of triples available to the public. In the life sciences, researches have started to connect disparate data sets of research results into one giant network of valuable information. Clearly, networks are becoming increasingly popular and growing rapidly in size, requiring scalable solutions for network data management.</p>
<p>This thesis focuses on the following aspects of network data management. We present a hierarchical index structure for external memory storage of network data that aims to maximize data locality. We propose efficient algorithms to answer subgraph matching queries against network databases and discuss effective pruning strategies to improve performance. We show how adaptive cost models can speed up subgraph matching query answering by assigning budgets to index retrieval operations and adjusting the query plan while executing.</p>
<p>We develop a cloud oriented social network database, COSI, which handles massive network datasets too large for a single computer by partitioning the data across multiple machines and achieving high performance query answering through asynchronous parallelization and cluster-aware heuristics.</p>
<p>Tracking multiple standing queries against a social network database is much faster with our novel multi-view maintenance algorithm, which exploits common substructures between queries.</p>
<p>To capture uncertainty inherent in social network querying, we define probabilistic subgraph matching queries over deterministic graph data and propose algorithms to answer them efficiently.</p>
<p>Finally, we introduce a general relational machine learning framework and rule-based language, Probabilistic Soft Logic, to learn from and probabilistically reason about social network data and describe applications to information integration and information fusion.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.knowledgefrominformation.com/2011/12/20/phd-social-network-data-management/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Probabilistic Subgraph Matching on Huge Social Networks</title>
		<link>http://www.knowledgefrominformation.com/2011/07/28/probabilistic-subgraph-matching-on-huge-social-networks/</link>
		<comments>http://www.knowledgefrominformation.com/2011/07/28/probabilistic-subgraph-matching-on-huge-social-networks/#comments</comments>
		<pubDate>Thu, 28 Jul 2011 20:08:39 +0000</pubDate>
		<dc:creator>Matthias</dc:creator>
		
		<category><![CDATA[Conference]]></category>

		<category><![CDATA[Publication]]></category>

		<guid isPermaLink="false">http://www.knowledgefrominformation.com/?p=144</guid>
		<description><![CDATA[Conference Paper, International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2011, Kaohsiung, Taiwan
Authors: Matthias Broecheler, Andrea Pugliese, VS Subrahmanian
Direct link to paper
Abstract
Users querying massive social networks or RDF databases are often not 100% certain about what they are looking for due to the complexity of the query or heterogeneity of the data. [...]]]></description>
			<content:encoded><![CDATA[<p>Conference Paper, International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2011, Kaohsiung, Taiwan<br />
Authors: Matthias Broecheler, Andrea Pugliese, VS Subrahmanian<br />
<a href="https://lccd-content.umiacs.umd.edu/main/papers/asonam2011_submission_134.pdf">Direct link to paper</a></p>
<p><strong>Abstract</strong><br />
Users querying massive social networks or RDF databases are often not 100% certain about what they are looking for due to the complexity of the query or heterogeneity of the data. In this paper, we propose “probabilistic subgraph” (PS) queries over a graph/network database, which afford users great flexibility in specifying “approximately” what they are looking for. We formally define the probability that a substitution satisfies a PS-query with respect to a graph database. We then present the PMATCH algorithm to answer such queries and prove its correctness. Our experimental evaluation demonstrates that PMATCH is efficient and scales to massive social networks with over a billion edges.</p>
<p><strong>Presentation</strong></p>
<div style="width:425px" id="__ss_11241329"><object id="__sse11241329" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=pmatchasonam2011v2-120124142008-phpapp02&#038;stripped_title=pmatch-asonam2011-v2&#038;userName=knowfrominfo" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><param name="wmode" value="transparent"/><embed name="__sse11241329" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=pmatchasonam2011v2-120124142008-phpapp02&#038;stripped_title=pmatch-asonam2011-v2&#038;userName=knowfrominfo" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" wmode="transparent" width="425" height="355"></embed></object></div>
]]></content:encoded>
			<wfw:commentRss>http://www.knowledgefrominformation.com/2011/07/28/probabilistic-subgraph-matching-on-huge-social-networks/feed/</wfw:commentRss>
		</item>
		<item>
		<title>A Budget-Based Algorithm for Efficient Subgraph Matching on Huge Networks</title>
		<link>http://www.knowledgefrominformation.com/2011/04/16/budget-match-cost-effective-subgraph-matching-on-large-networks/</link>
		<comments>http://www.knowledgefrominformation.com/2011/04/16/budget-match-cost-effective-subgraph-matching-on-large-networks/#comments</comments>
		<pubDate>Sat, 16 Apr 2011 12:46:40 +0000</pubDate>
		<dc:creator>Matthias</dc:creator>
		
		<category><![CDATA[Conference]]></category>

		<category><![CDATA[Publication]]></category>

		<guid isPermaLink="false">http://www.knowledgefrominformation.com/?p=133</guid>
		<description><![CDATA[Workshop Paper, ICDE Workshop on Graph Data Management, 2011, Hannover, Germany
Authors: Matthias Broecheler, Andrea Pugliese, VS Subrahmanian
Direct link to paper
Abstract
As social network and RDF data grow dramatically in size to billions of edges, the ability to scalably answer queries posed over graph datasets becomes increasingly important.
In this paper, we consider subgraph matching queries which are [...]]]></description>
			<content:encoded><![CDATA[<p>Workshop Paper, ICDE Workshop on Graph Data Management, 2011, Hannover, Germany<br />
Authors: Matthias Broecheler, Andrea Pugliese, VS Subrahmanian<br />
<a href="https://scm.umiacs.umd.edu/redmine/lccd/attachments/download/21/gdm2011_submission_budgetmatch.pdf">Direct link to paper</a></p>
<p><strong>Abstract</strong><br />
As social network and RDF data grow dramatically in size to billions of edges, the ability to scalably answer queries posed over graph datasets becomes increasingly important.<br />
In this paper, we consider subgraph matching queries which are often posed to social networks and RDF databases - for such queries, we want to find all matching instances in a graph database. Past work on subgraph matching queries uses static cost models which can be very inaccurate due to long-tailed degree distributions commonly found in real world networks. We propose the BudgetMatch query answering algorithm. BudgetMatch costs and recosts query parts adaptively as it executes and learns more about the search space. We show that using this strategy, BudgetMatch can quickly answer complex subgraph queries on very large graph data. Specifically, on a real world social media data set consisting of 1.12 billion edges, we can answer complex subgraph queries in under one second and significantly outperform existing subgraph matching algorithms.</p>
<p><strong>Presentation</strong></p>
<div style="width:425px" id="__ss_7648280"><object id="__sse7648280" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=budgetmatch-gdm-v1-110416073348-phpapp02&#038;stripped_title=budgetmatch-cost-effective-subgraph-matching-on-large-networks&#038;userName=knowfrominfo" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed name="__sse7648280" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=budgetmatch-gdm-v1-110416073348-phpapp02&#038;stripped_title=budgetmatch-cost-effective-subgraph-matching-on-large-networks&#038;userName=knowfrominfo" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object></div>
]]></content:encoded>
			<wfw:commentRss>http://www.knowledgefrominformation.com/2011/04/16/budget-match-cost-effective-subgraph-matching-on-large-networks/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Probabilistic Soft Logic</title>
		<link>http://www.knowledgefrominformation.com/2010/12/03/probabilistic-soft-logic/</link>
		<comments>http://www.knowledgefrominformation.com/2010/12/03/probabilistic-soft-logic/#comments</comments>
		<pubDate>Sat, 04 Dec 2010 02:10:06 +0000</pubDate>
		<dc:creator>Matthias</dc:creator>
		
		<category><![CDATA[Conference]]></category>

		<category><![CDATA[Publication]]></category>

		<guid isPermaLink="false">http://www.knowledgefrominformation.com/?p=119</guid>
		<description><![CDATA[Abstract
There is a growing interest in methods for exploiting causal or correlational dependencies in structured domains. Exploiting such dependencies often results in improved predictive performance on complex inference tasks in diverse domains such as information integration, natural language processing, and computer vision.  In this presentation, we introduce probabilistic soft logic (PSL), a general-purpose framework [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Abstract</strong><br />
There is a growing interest in methods for exploiting causal or correlational dependencies in structured domains. Exploiting such dependencies often results in improved predictive performance on complex inference tasks in diverse domains such as information integration, natural language processing, and computer vision.  In this presentation, we introduce probabilistic soft logic (PSL), a general-purpose framework for expressing, reasoning about and learning structural dependencies.<br />
PSL provides a declarative language tailored to relational domains that require reasoning about similarity and/or probability. Some of the novel aspects of PSL include a representation based on continuous valued random variables, efficient polynomial-time inference algorithms, native support for reasoning about sets, and the ability to estimate confidences values for predictions. This presentation provides a detailed account of PSL covering its mathematical foundation, logic programming semantics, inference and learning algorithms, scalability through parallelization, and different applications. We close with a demonstration of the PSL system implementation.</p>
<p><strong>Presentation</strong></p>
<div style="width:425px" id="__ss_6041686"><object id="__sse6041686" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=grandpsltalkdec62010-101205200125-phpapp01&#038;stripped_title=grand-psl-talkdec62010&#038;userName=knowfrominfo" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed name="__sse6041686" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=grandpsltalkdec62010-101205200125-phpapp01&#038;stripped_title=grand-psl-talkdec62010&#038;userName=knowfrominfo" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object></div>
]]></content:encoded>
			<wfw:commentRss>http://www.knowledgefrominformation.com/2010/12/03/probabilistic-soft-logic/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Decision-Driven Models with Probabilistic Soft Logic</title>
		<link>http://www.knowledgefrominformation.com/2010/11/15/decision-driven-models-with-probabilistic-soft-logic/</link>
		<comments>http://www.knowledgefrominformation.com/2010/11/15/decision-driven-models-with-probabilistic-soft-logic/#comments</comments>
		<pubDate>Tue, 16 Nov 2010 01:54:19 +0000</pubDate>
		<dc:creator>Matthias</dc:creator>
		
		<category><![CDATA[Conference]]></category>

		<category><![CDATA[Publication]]></category>

		<guid isPermaLink="false">http://www.knowledgefrominformation.com/?p=110</guid>
		<description><![CDATA[Workshop Paper, NIPS Workshop on Predictive Models in Personalized Medicine, 2010, Whistler, Canada
Authors: Stephen H. Bach, Matthias Broecheler, Stanley Kok, and Lise Getoor
Direct link to paper
Abstract
We introduce the concept of a decision-driven model, a probabilistic model that reasons directly over the uncertain information of interest to a decision maker. We motivate the use of these [...]]]></description>
			<content:encoded><![CDATA[<p>Workshop Paper, NIPS Workshop on Predictive Models in Personalized Medicine, 2010, Whistler, Canada<br />
Authors: Stephen H. Bach, Matthias Broecheler, Stanley Kok, and Lise Getoor<br />
<a href="http://waimea.cs.umd.edu:8080/basilic/web/Publications/2010/bach:pmpm10/bach-pmpm10.pdf">Direct link to paper</a></p>
<p><strong>Abstract</strong><br />
We introduce the concept of a decision-driven model, a probabilistic model that reasons directly over the uncertain information of interest to a decision maker. We motivate the use of these models from the perspective of personalized medicine. Decision-driven models have a number of benefits that are of particular value in this domain, such as being easily interpretable and naturally quantifying confidences in both evidence and predictions. We show how decision-driven models can easily be constructed using probabilistic soft logic, a recently introduced framework for statistical relational learning and inference which allows the specification of medical domain knowledge in concise first-order-logic rules with assigned confidence values. </p>
<p>The paper is presented as a poster. Follow the link for more information on <a href="http://www.slideshare.net/knowfrominfo/grand-psl-talkdec62010/88">decision-driven modeling in probabilistic soft logic.</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.knowledgefrominformation.com/2010/11/15/decision-driven-models-with-probabilistic-soft-logic/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Computing marginal distributions over continuous Markov networks for statistical relational learning</title>
		<link>http://www.knowledgefrominformation.com/2010/10/30/computing-marginal-distributions-over-continuous-markov-networks-for-statistical-relational-learning/</link>
		<comments>http://www.knowledgefrominformation.com/2010/10/30/computing-marginal-distributions-over-continuous-markov-networks-for-statistical-relational-learning/#comments</comments>
		<pubDate>Sun, 31 Oct 2010 01:55:03 +0000</pubDate>
		<dc:creator>Matthias</dc:creator>
		
		<category><![CDATA[Conference]]></category>

		<category><![CDATA[Publication]]></category>

		<guid isPermaLink="false">http://www.knowledgefrominformation.com/?p=114</guid>
		<description><![CDATA[Conference Paper, Advances in Neural Information Processing Systems (NIPS), 2010, Vancouver, Canada
Authors: Matthias Broecheler and Lise Getoor
Direct link to paper
Data set used in the experiments
Abstract
Continuous Markov random fields are a general formalism to model joint probability distributions over events with continuous outcomes. We prove that marginal computation for constrained continuous MRFs is #P-hard in general [...]]]></description>
			<content:encoded><![CDATA[<p>Conference Paper, Advances in Neural Information Processing Systems (NIPS), 2010, Vancouver, Canada<br />
Authors: Matthias Broecheler and Lise Getoor<br />
<a href="http://waimea.cs.umd.edu:8080/basilic/web/Publications/2010/broecheler:nips10/broecheler-nips10.pdf">Direct link to paper</a><br />
<a href="https://scm.umiacs.umd.edu/redmine/lccd/attachments/download/8/nips2010data.zip">Data set used in the experiments</a></p>
<p><strong>Abstract</strong><br />
Continuous Markov random fields are a general formalism to model joint probability distributions over events with continuous outcomes. We prove that marginal computation for constrained continuous MRFs is #P-hard in general and present a polynomial-time approximation scheme under mild assumptions on the structure of the random field. Moreover, we introduce a sampling algorithm to compute marginal distributions and develop novel techniques to increase its efficency. Continuous MRFs are a general purpose probabilistic modeling tool and we demonstrate how they can be applied to statistical relational learning. On the problem of collective classification, we evaluate our algorithm and show that the standard deviation of marginals serves as a useful measure of confidence. </p>
<p><strong>Poster Presentation</strong></p>
<div style="width:425px" id="__ss_6041666"><object id="__sse6041666" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=nips2010postermarginals-101205195828-phpapp01&#038;stripped_title=computing-marginal-in-ccmrfs-nips-2010&#038;userName=knowfrominfo" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed name="__sse6041666" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=nips2010postermarginals-101205195828-phpapp01&#038;stripped_title=computing-marginal-in-ccmrfs-nips-2010&#038;userName=knowfrominfo" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object></div>
]]></content:encoded>
			<wfw:commentRss>http://www.knowledgefrominformation.com/2010/10/30/computing-marginal-distributions-over-continuous-markov-networks-for-statistical-relational-learning/feed/</wfw:commentRss>
		</item>
		<item>
		<title>A Scalable Framework for Modeling Competitive Diffusion in Social Networks</title>
		<link>http://www.knowledgefrominformation.com/2010/08/01/a-scalable-framework-for-modeling-competitive-diffusion-in-social-networks/</link>
		<comments>http://www.knowledgefrominformation.com/2010/08/01/a-scalable-framework-for-modeling-competitive-diffusion-in-social-networks/#comments</comments>
		<pubDate>Sun, 01 Aug 2010 23:55:44 +0000</pubDate>
		<dc:creator>Matthias</dc:creator>
		
		<category><![CDATA[Publication]]></category>

		<guid isPermaLink="false">http://www.knowledgefrominformation.com/?p=80</guid>
		<description><![CDATA[Conference Paper, Proceedings of the 2010 IEEE International Conference on Social Computing, Symposium Section
Authors: Matthias Broecheler, Paulo Shakarian, and V.S. Subrahmanian
Direct link to paper
Abstract
Multiple phenomena often diffuse through a social network, sometimes in competition with one another. Product adoption and political elections are two examples where network diffusion is inherently competitive in nature. For example, [...]]]></description>
			<content:encoded><![CDATA[<p>Conference Paper, Proceedings of the 2010 IEEE International Conference on Social Computing, Symposium Section<br />
Authors: Matthias Broecheler, Paulo Shakarian, and V.S. Subrahmanian<br />
<a href="http://lccd-content.umiacs.umd.edu/main/papers/diffusion-sc.pdf">Direct link to paper</a></p>
<p><strong>Abstract</strong><br />
Multiple phenomena often diffuse through a social network, sometimes in competition with one another. Product adoption and political elections are two examples where network diffusion is inherently competitive in nature. For example, individuals may choose to only select one product from a set of competing products (i.e. most people will need only one cell-phone provider) or can only vote for one person in a slate of political candidate (in most electoral systems). We introduce the weighted generalized annotated program (wGAP) framework for expressing competitive diffusion models. Applications are interested in the eventual results from multiple competing diffusion models (e.g. what is the likely number of sales of a given product, or how many people will support a particular candidate). We define the â€œmost probable interpretationâ€ (MPI) problem which technically formalizes this need. We develop algorithms to efficiently solve MPI and show experimentally that our algorithms work on graphs with millions of vertices.</p>
<p><strong>Presentation</strong></p>
<div style="width:425px" id="__ss_5047071"><object id="__sse5047071" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=socialcom2010v2-100824122603-phpapp01&#038;stripped_title=a-scalable-framework-for-modeling-competitive-diffusion-in-social-networks" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed name="__sse5047071" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=socialcom2010v2-100824122603-phpapp01&#038;stripped_title=a-scalable-framework-for-modeling-competitive-diffusion-in-social-networks" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object>
<div style="padding:5px 0 12px">View more presentations from <a href="http://www.slideshare.net/knowfrominfo">Matthias Broecheler</a>.</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.knowledgefrominformation.com/2010/08/01/a-scalable-framework-for-modeling-competitive-diffusion-in-social-networks/feed/</wfw:commentRss>
		</item>
		<item>
		<title>COSI: Cloud Oriented Subgraph Identification in Massive Social Networks</title>
		<link>http://www.knowledgefrominformation.com/2010/08/01/cosi-cloud-oriented-subgraph-identification-in-massive-social-networks/</link>
		<comments>http://www.knowledgefrominformation.com/2010/08/01/cosi-cloud-oriented-subgraph-identification-in-massive-social-networks/#comments</comments>
		<pubDate>Sun, 01 Aug 2010 06:00:34 +0000</pubDate>
		<dc:creator>Matthias</dc:creator>
		
		<category><![CDATA[Publication]]></category>

		<guid isPermaLink="false">http://www.knowledgefrominformation.com/?p=76</guid>
		<description><![CDATA[Conference Paper, Proceedings of the 2010 International Conference on Advances in Social Networks Analysis and Mining, Odense, Denmark
Authors: Matthias BrÃ¶cheler, Andrea Pugliese and V.S. Subrahmanian
Direct link to paper
3rd Best Paper Award
Abstract
Subgraph matching is a key operation on graph data. Social network (SN) providers may want to find all subgraphs within their social network that â€œmatchâ€ [...]]]></description>
			<content:encoded><![CDATA[<p>Conference Paper, Proceedings of the 2010 International Conference on Advances in Social Networks Analysis and Mining, Odense, Denmark<br />
Authors: Matthias BrÃ¶cheler, Andrea Pugliese and V.S. Subrahmanian<br />
<a href="http://lccd-content.umiacs.umd.edu/main/papers/asonam2010.pdf">Direct link to paper</a></p>
<p><strong>3rd Best Paper Award</strong></p>
<p><strong>Abstract</strong><br />
Subgraph matching is a key operation on graph data. Social network (SN) providers may want to find all subgraphs within their social network that â€œmatchâ€ certain query graph patterns. Unfortunately, subgraph matching is NP-complete, making its application to massive SNs a major challenge. Past work has shown how to implement subgraph matching on a single processor when the graph has 10-25M edges. In this paper, we show how to use cloud computing in conjunction with such existing single processor methods to efficiently match complex subgraphs on graphs as large as 778M edges. A cloud consists of one â€œmasterâ€ compute node and k â€œslaveâ€ compute nodes. We first develop a probabilistic method to estimate probabilities that a vertex will be retrieved by a random query and that a pair of vertices will be successively retrieved by a random query. We use these probability estimates to define edge weights in an SN and to compute minimal edge cuts to partition the graph amongst k slave nodes. We develop algorithms for both master and slave nodes that try to minimize communication overhead. The resulting COSI system can answer complex queries over real-world SN data containing over 778M edges very efficiently.</p>
<p><strong>Presentation at ASONAM 2010</strong></p>
<div style="width:425px" id="__ss_4983704"><object id="__sse4983704" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=asonam2010v2-100816125021-phpapp02&#038;rel=0&#038;stripped_title=cosi-cloud-oriented-subgraph-i" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed name="__sse4983704" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=asonam2010v2-100816125021-phpapp02&#038;rel=0&#038;stripped_title=cosi-cloud-oriented-subgraph-i" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object>
<div style="padding:5px 0 12px">View more presentations from <a href="http://www.slideshare.net/knowfrominfo">Matthias Broecheler</a>.</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.knowledgefrominformation.com/2010/08/01/cosi-cloud-oriented-subgraph-identification-in-massive-social-networks/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Probabilistic Similarity Logic</title>
		<link>http://www.knowledgefrominformation.com/2010/07/08/probabilistic-similarity-logic/</link>
		<comments>http://www.knowledgefrominformation.com/2010/07/08/probabilistic-similarity-logic/#comments</comments>
		<pubDate>Thu, 08 Jul 2010 23:48:25 +0000</pubDate>
		<dc:creator>Matthias</dc:creator>
		
		<category><![CDATA[Publication]]></category>

		<guid isPermaLink="false">http://www.knowledgefrominformation.com/?p=74</guid>
		<description><![CDATA[Conference Paper, Proceedings of the 2010 conference on Uncertainty in Artificial Intelligence
Presented at: Conference on Uncertainty in Artificial Intelligence, held on Santa Catalina Island, CA, USA from July 8th - July 11th, 2010
Authors: Matthias Broecheler, Lilyana Mihalkova, Lise Getoor
Direct link to paper
Data set used in the experiments
Abstract
Many machine learning applications require the ability to learn [...]]]></description>
			<content:encoded><![CDATA[<p>Conference Paper, Proceedings of the 2010 conference on Uncertainty in Artificial Intelligence<br />
Presented at: Conference on Uncertainty in Artificial Intelligence, held on Santa Catalina Island, CA, USA from July 8th - July 11th, 2010<br />
Authors: Matthias Broecheler, Lilyana Mihalkova, Lise Getoor<br />
<a href="http://linqs.cs.umd.edu/basilic/web/Publications/2010/broecheler:uai10/broecheler-uai10.pdf">Direct link to paper</a><br />
<a href="https://scm.umiacs.umd.edu/redmine/lccd/attachments/download/9/uai2010experimentsData.zip">Data set used in the experiments</a></p>
<p><strong>Abstract</strong><br />
Many machine learning applications require the ability to learn from and reason about noisy multi-relational data. To address this, several effective representations have been developed that provide both a language for expressing the structural regularities of a domain, and principled sup- port for probabilistic inference. In addition to these two aspects, however, many applications also involve a third aspectâ€“the need to reason about similaritiesâ€“which has not been directly supported in existing frameworks. This paper introduces probabilistic similarity logic (PSL), a general-purpose framework for joint reason- ing about similarity in relational domains that incorporates probabilistic reasoning about similarities and relational structure in a principled way. PSL can integrate any existing domain- specific similarity measures and also supports reasoning about similarities between sets of entities. We provide efficient inference and learn- ing techniques for PSL and demonstrate its effectiveness both in common relational tasks and in settings that require reasoning about similarity.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.knowledgefrominformation.com/2010/07/08/probabilistic-similarity-logic/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
