ResMBS: Difference between revisions

From ngfci
Jump to navigation Jump to search
No edit summary
No edit summary
 
(3 intermediate revisions by the same user not shown)
Line 1: Line 1:


  resMBS is a graph / dataset that has been extracted from the contents of financial prospecti for US  
  resMBS is a graph / dataset that has been extracted from the contents of financial prospecti for US  
  residential mortgage backed securities filed with the SEC. These securities were first created in 2002.  
  residential mortgage backed securities filed with the SEC. These securities started becoming very
They reached a peak in 2006 and then started to decline in 2007 and came to an abrupt end in 2008.  
popular in 2002. The issued securities reached a peak in 2006 and then started to decline in 2007  
and came to an abrupt end in 2008.  
  We extracted the "financial supply chain" comprising "financial institutions" (FI) and the role (Role)  
  We extracted the "financial supply chain" comprising "financial institutions" (FI) and the role (Role)  
  that they play on a financial contract (FC).
  that they play on a financial contract (FC).
Line 11: Line 13:
  Doug Burdick, IBM
  Doug Burdick, IBM
  Soham De and Louiqa Raschid and Mingchao Shao and Zheng Xu and Elena Zotkina, University of Maryland
  Soham De and Louiqa Raschid and Mingchao Shao and Zheng Xu and Elena Zotkina, University of Maryland
  [https://drive.google.com/open?id=0BzTeYQSh4QTsdzRxbGhKSDVpblE]
  [https://drive.google.com/open?id=0BzTeYQSh4QTsWmJXSTM0TEFLa1E]


  The networks described in the paper can be viewed here.
  The networks described in the paper can be viewed here.
Line 22: Line 24:
  Probabilistic Financial Community Models with Latent Dirichlet Allocation for Financial Supply Chains
  Probabilistic Financial Community Models with Latent Dirichlet Allocation for Financial Supply Chains
  Zheng Xu and Louiqa Raschid, University of Maryland
  Zheng Xu and Louiqa Raschid, University of Maryland
  [https://drive.google.com/open?id=0BzTeYQSh4QTsYUFIVlQ3QTVHQW8]
  [https://drive.google.com/open?id=0BzTeYQSh4QTsWWJ4b0RiMWFaSlk]


  If you want the gory details of the tools on the IBM System T platform that were developed ...
  If you want the gory details of the tools on the IBM System T platform that were developed ...
  Exploiting Lists of Names for Named Entity Identification of Financial Institutions from Unstructured Documents
  Exploiting Lists of Names for Named Entity Identification of Financial Institutions from  
Unstructured Documents
  Zheng Xu (University of Maryland) and Douglas Burdick (IBM) and Louiqa Raschid (University of Maryland)
  Zheng Xu (University of Maryland) and Douglas Burdick (IBM) and Louiqa Raschid (University of Maryland)
  [http://arxiv.org/abs/1602.04427]
  [http://arxiv.org/abs/1602.04427]


  The dataset is available for research.
  The dataset is available for research.

Latest revision as of 04:14, 23 May 2016

resMBS is a graph / dataset that has been extracted from the contents of financial prospecti for US 
residential mortgage backed securities filed with the SEC. These securities started becoming very
popular in 2002. The issued securities reached a peak in 2006 and then started to decline in 2007 
and came to an abrupt end in 2008. 

We extracted the "financial supply chain" comprising "financial institutions" (FI) and the role (Role) 
that they play on a financial contract (FC).
The following paper provides an overview of how the dataset was created and some preliminary clustering 
analysis on the graph.
resMBS: Constructing a Financial Supply Chain from Prospecti
Doug Burdick, IBM
Soham De and Louiqa Raschid and Mingchao Shao and Zheng Xu and Elena Zotkina, University of Maryland
[1]
The networks described in the paper can be viewed here.
FI clusters [2]
FI clusters based on FC-FC similarity [3]
FI-FC bipartite graph [4]
We used a topic modeling approach to develop a model FI-Comm where a topic is defined over a vocabulary
of FIs and a model Role-FI-Comm where a topic is defined over a vocabulary of Role-FI pairs.
Probabilistic Financial Community Models with Latent Dirichlet Allocation for Financial Supply Chains
Zheng Xu and Louiqa Raschid, University of Maryland
[5]
If you want the gory details of the tools on the IBM System T platform that were developed ...
Exploiting Lists of Names for Named Entity Identification of Financial Institutions from 
Unstructured Documents
Zheng Xu (University of Maryland) and Douglas Burdick (IBM) and Louiqa Raschid (University of Maryland)
[6]
The dataset is available for research.