You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Graphlet AI is a data engineering, data science and artificial intelligence consultancy specializing in <i>knowledge graph construction</i>, also known as <b>property graph construction</b>. We transform and refine raw data on your data lake to build large networks ranging in the millions, billions or even trillions of nodes and edges that model entire business domains to solve complex problems with global footprints. We use big data tools and go beyond simple ETL by using machine learning and artificial intelligence to construct a graph model of your business domain that maps closely to solutions to your business problems. Using a modern graph database, your data science and machine learning teams can then efficiently mine this refined graph to find solutions to your most pressing data science problems.
22
+
<div>
23
+
Graphlet AI is a data engineering, data science and artificial intelligence consultancy specializing in <i>knowledge graph construction</i>, also known as <b>property graph construction</b>. We build data pipelines that take raw data and feed your graph database clean data.
24
+
</div>
25
+
<div style="margin-top: 2%;"></div>
26
+
<div>
27
+
We transform and refine raw data on your data lake to build large networks ranging in the millions, billions or even trillions of nodes and edges that model entire business domains to solve complex problems with global footprints.
28
+
</div>
29
+
<div style="margin-top: 2%;"></div>
30
+
<div>
31
+
We love big data and large networks. We use big data tools to scale data pipelines that go beyond traditional ETL and entity resolution using artificial intelligenecs - graph machine learning - to construct a high fidelity network model of your business domain that maps directly to solutions to your business problems. It lets you run the queries that answer problems vexing you and driving features your customers demand. Using a modern graph database, your data science and machine learning teams can then efficiently mine this refined graph to find solutions to your most pressing data science problems.
<img style="width: 80%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution-Phase-1-Bronze-ETL.png" />
54
+
<img style="width: 70%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution-Phase-1-Bronze-ETL.png" />
44
55
</center>
45
56
</div>
46
57
<div style="margin-top: 2%;"></div>
47
58
<div>
48
59
<center>
49
-
<img style="width: 80%;" alt="Transformed, Cleaned Data in Silver Tables" src="assets/slides/Entity-Resolution-Phase-1-Silver-ETL.png" />
60
+
<img style="width: 70%;" alt="Transformed, Cleaned Data in Silver Tables" src="assets/slides/Entity-Resolution-Phase-1-Silver-ETL.png" />
50
61
</center>
51
62
<div style="margin-top: 2%;"></div>
52
63
<div>
@@ -60,13 +71,13 @@ background: home/bg.png
60
71
<div style="margin-top: 2%;"></div>
61
72
<div>
62
73
<center>
63
-
<img style="width: 80%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution-Phase-2---Blocking.jpg" />
74
+
<img style="width: 70%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution-Phase-2---Blocking.jpg" />
64
75
</center>
65
76
</div>
66
77
<div style="margin-top: 2%;"></div>
67
78
<div>
68
79
<center>
69
-
<img style="width: 80%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution-Phase-2---Manual-Matching.jpg" />
80
+
<img style="width: 70%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution-Phase-2---Manual-Matching.jpg" />
70
81
</center>
71
82
</div>
72
83
<div style="margin-top: 2%;"></div>
@@ -80,25 +91,25 @@ background: home/bg.png
80
91
<div style="margin-top: 2%;"></div>
81
92
<div>
82
93
<center>
83
-
<img style="width: 80%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution---Ditto-Encoding.jpg" />
94
+
<img style="width: 70%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution---Ditto-Encoding.jpg" />
84
95
</center>
85
96
</div>
86
97
<div style="margin-top: 2%;"></div>
87
98
<div>
88
99
<center>
89
-
<img style="width: 80%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution-Phase-3---LSH-Blocking.jpg" />
100
+
<img style="width: 70%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution-Phase-3---LSH-Blocking.jpg" />
90
101
</center>
91
102
</div>
92
103
<div style="margin-top: 2%;"></div>
93
104
<div>
94
105
<center>
95
-
<img style="width: 80%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution-Phase-3---Embedding-Distance.jpg" />
106
+
<img style="width: 70%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution-Phase-3---Embedding-Distance.jpg" />
96
107
</center>
97
108
</div>
98
109
<div style="margin-top: 2%;"></div>
99
110
<div>
100
111
<center>
101
-
<img style="width: 80%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution-Phase-3---Fine-Tuned-Classifier.jpg" />
112
+
<img style="width: 70%;" alt="Raw Data in Bronze Tables" src="assets/slides/Entity-Resolution-Phase-3---Fine-Tuned-Classifier.jpg" />
102
113
</center>
103
114
</div>
104
115
<div style="margin-top: 2%;"></div>
@@ -115,7 +126,7 @@ background: home/bg.png
115
126
<div style="margin-top: 2%;"></div>
116
127
<div>
117
128
<center>
118
-
<img style="width: 80%;" alt="Property graphs vs RDF Triples. Both are knowledge graphs." src="assets/slides/RDF-Triple-Stores-vs-Property-Graphs.jpg" />
129
+
<img style="width: 70%;" alt="Property graphs vs RDF Triples. Both are knowledge graphs." src="assets/slides/RDF-Triple-Stores-vs-Property-Graphs.jpg" />
119
130
</center>
120
131
</div>
121
132
<div style="margin-top: 2%;"></div>
@@ -149,6 +160,7 @@ background: home/bg.png
149
160
<div>
150
161
We can build knowledge graphs for any platform, but here are a few tools that are more up our alley to create business value using graphs and networks:
151
162
</div>
163
+
<div style="margin-top: 2%;"></div>
152
164
<li>
153
165
<ul>Python tools like <a href="https://pandas.pydata.org/">Pandas</a> and <a href="https://networkx.org/">NetworkX</a>, <a href="https://graph-tool.skewed.de/">graph-tool</a>, <a href="https://networkit.github.io/">NetworKit</a> or <a href="https://www.graphifi.com/easygraph">EasyGraph</a></ul>
154
166
<ul><a href="https://www.r-project.org/">R</a> tools like <a href="https://igraph.org/">iGraph</a>, <a href="https://tidygraph.data-imaginist.com/">tidygraph</a> and <a href="https://ggraph.data-imaginist.com/">ggraph</a></ul>
@@ -163,13 +175,15 @@ background: home/bg.png
163
175
</li>
164
176
</div>
165
177
<div>
178
+
<div style="margin-top: 2%;"></div>
166
179
<h2>Principal Consultant</h2>
167
180
<div>
168
181
My name is Russell Jurney. I work at the intersection of big data, large networks - property graphs or knowledge graphs, representation learning with Graph Neural Networks (GNNs), Natural Language Processing (NLP) and Understanding (NLU), model explainability using network visualization and vector search for information retrieval.
169
182
I am a startup product and engineering executive focused on building products driven by billion node+ networks. I have worked at cool places like Ning, LinkedIn and Hortonworks. I co-founded Deep Discovery to use networks, GNNs and visualizations to build an explainable risk score for KYC / AML.
170
183
</div>
184
+
<div style="margin-top: 2%;"></div>
171
185
<div>
172
-
I am a four-time O'Reilly author with 120 citations on Google Scholar for being the first to write about “agile data science” - agile development as applied to data science and machine learning. I am an applied researcher and product manager with 17 years of experience building and shipping data-driven products.
186
+
I am a four-time O'Reilly author with 122 citations on Google Scholar for being the first to write about “agile data science” - agile development as applied to data science and machine learning. I am an applied researcher and product manager with 17 years of experience building and shipping data-driven products.
173
187
</div>
174
188
<div>
175
189
I am currently fascinated by knowledge graph / property graph construction, graph representation learning, graph neural networks (GNNs), NLP/NLU techniques such as information extraction, named entity resolution (NER), coreference resolution, fact extraction, and entity linking. I do network science and machine learning - so I get stuff done :)
0 commit comments