Skip to content

Commit 6335cfa

Browse files
committed
transcripts
1 parent 93d96b9 commit 6335cfa

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

46 files changed

+6532
-281
lines changed

transcripts/090_data_wrangling_with_python.txt

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1836,11 +1836,11 @@
18361836

18371837
00:57:49 What one would you recommend to people that maybe they don't know about or haven't heard about?
18381838

1839-
00:57:54 Well, we didn't get to talk much about NLP, but I really, really love Gensum and Spacey.
1839+
00:57:54 Well, we didn't get to talk much about NLP, but I really, really love Gensum and spaCy.
18401840

18411841
00:58:00 Both of them are really changing and pushing the space of kind of where academia is with natural language processing and giving it and making it available for us mere mortals.
18421842

1843-
00:58:12 So I really recommend if you want to take a look at how to use neural networks with natural language processing, that you check out both Gensum and Spacey.
1843+
00:58:12 So I really recommend if you want to take a look at how to use neural networks with natural language processing, that you check out both Gensum and spaCy.
18441844

18451845
00:58:21 Okay.
18461846

transcripts/090_data_wrangling_with_python.vtt

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2788,13 +2788,13 @@ And the first is we already enumerated quite the list, but there's over 90,000 p
27882788
What one would you recommend to people that maybe they don't know about or haven't heard about?
27892789

27902790
00:57:54.200 --> 00:58:00.040
2791-
Well, we didn't get to talk much about NLP, but I really, really love Gensum and Spacey.
2791+
Well, we didn't get to talk much about NLP, but I really, really love Gensum and spaCy.
27922792

27932793
00:58:00.040 --> 00:58:11.760
27942794
Both of them are really changing and pushing the space of kind of where academia is with natural language processing and giving it and making it available for us mere mortals.
27952795

27962796
00:58:12.160 --> 00:58:21.000
2797-
So I really recommend if you want to take a look at how to use neural networks with natural language processing, that you check out both Gensum and Spacey.
2797+
So I really recommend if you want to take a look at how to use neural networks with natural language processing, that you check out both Gensum and spaCy.
27982798

27992799
00:58:21.000 --> 00:58:21.560
28002800
Okay.

transcripts/154-python-in-genomics.txt

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -680,13 +680,13 @@
680680

681681
00:27:08 And of course, it's a constant treadmill sort of thing.
682682

683-
00:27:12 So one of the tools that you're using that didn't surprise me but I think is interesting and I want to hear more about is Spacey.
683+
00:27:12 So one of the tools that you're using that didn't surprise me but I think is interesting and I want to hear more about is spaCy.
684684

685-
00:27:17 So I don't even think I've mentioned Spacey on the podcast before.
685+
00:27:17 So I don't even think I've mentioned spaCy on the podcast before.
686686

687687
00:27:20 Tell us about that.
688688

689-
00:27:21 What's Spacey?
689+
00:27:21 What's spaCy?
690690

691691
00:27:22 Yeah.
692692

@@ -708,7 +708,7 @@
708708

709709
00:27:58 We can't spend billions of dollars.
710710

711-
00:27:59 But what we're doing is we're taking Spacey and using that to parse some of these free text files and basically make recommendations to people.
711+
00:27:59 But what we're doing is we're taking spaCy and using that to parse some of these free text files and basically make recommendations to people.
712712

713713
00:28:09 So doing things like what are called entity recognition.
714714

@@ -720,7 +720,7 @@
720720

721721
00:28:23 And this is a great tool for extracting out named entities like that.
722722

723-
00:28:27 We've trained Spacey to find named entities based on our ontologies, our data within our KMS.
723+
00:28:27 We've trained spaCy to find named entities based on our ontologies, our data within our KMS.
724724

725725
00:28:35 Right.
726726

@@ -732,7 +732,7 @@
732732

733733
00:28:39 Exactly right.
734734

735-
00:28:40 So there's a pattern matching framework that's within Spacey that's really very easy to use.
735+
00:28:40 So there's a pattern matching framework that's within spaCy that's really very easy to use.
736736

737737
00:28:44 And then the other thing we'd use it for is for classification.
738738

@@ -778,13 +778,13 @@
778778

779779
00:30:05 It's pretty amazing.
780780

781-
00:30:06 We like Spacey a lot.
781+
00:30:06 We like spaCy a lot.
782782

783783
00:30:07 I originally tried playing with NLTK a few years ago and actually kind of ran into some barriers.
784784

785785
00:30:11 It's an old that's an older project.
786786

787-
00:30:13 Spacey is really modern and that it's, you know, kind of does some of the best practices with Python.
787+
00:30:13 spaCy is really modern and that it's, you know, kind of does some of the best practices with Python.
788788

789789
00:30:18 I highly recommend it.
790790

@@ -800,7 +800,7 @@
800800

801801
00:30:29 It definitely, they have it lined up to, when you go to visit spacey.io, it really looks appealing and polished.
802802

803-
00:30:37 I was wondering why you didn't choose, what the difference or what made you choose Spacey over NLTK?
803+
00:30:37 I was wondering why you didn't choose, what the difference or what made you choose spaCy over NLTK?
804804

805805
00:30:42 It's actually pretty obvious straight away, isn't it?
806806

transcripts/154-python-in-genomics.vtt

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1027,16 +1027,16 @@ Maybe I should do that.
10271027
And of course, it's a constant treadmill sort of thing.
10281028

10291029
00:27:12.500 --> 00:27:17.060
1030-
So one of the tools that you're using that didn't surprise me but I think is interesting and I want to hear more about is Spacey.
1030+
So one of the tools that you're using that didn't surprise me but I think is interesting and I want to hear more about is spaCy.
10311031

10321032
00:27:17.060 --> 00:27:20.260
1033-
So I don't even think I've mentioned Spacey on the podcast before.
1033+
So I don't even think I've mentioned spaCy on the podcast before.
10341034

10351035
00:27:20.260 --> 00:27:21.300
10361036
Tell us about that.
10371037

10381038
00:27:21.300 --> 00:27:22.000
1039-
What's Spacey?
1039+
What's spaCy?
10401040

10411041
00:27:22.000 --> 00:27:22.280
10421042
Yeah.
@@ -1069,7 +1069,7 @@ We're a small company.
10691069
We can't spend billions of dollars.
10701070

10711071
00:27:59.920 --> 00:28:09.520
1072-
But what we're doing is we're taking Spacey and using that to parse some of these free text files and basically make recommendations to people.
1072+
But what we're doing is we're taking spaCy and using that to parse some of these free text files and basically make recommendations to people.
10731073

10741074
00:28:09.520 --> 00:28:13.620
10751075
So doing things like what are called entity recognition.
@@ -1087,7 +1087,7 @@ Those would all be proper nouns.
10871087
And this is a great tool for extracting out named entities like that.
10881088

10891089
00:28:27.880 --> 00:28:35.580
1090-
We've trained Spacey to find named entities based on our ontologies, our data within our KMS.
1090+
We've trained spaCy to find named entities based on our ontologies, our data within our KMS.
10911091

10921092
00:28:35.580 --> 00:28:35.900
10931093
Right.
@@ -1105,7 +1105,7 @@ Something like that.
11051105
Exactly right.
11061106

11071107
00:28:40.040 --> 00:28:44.300
1108-
So there's a pattern matching framework that's within Spacey that's really very easy to use.
1108+
So there's a pattern matching framework that's within spaCy that's really very easy to use.
11091109

11101110
00:28:44.300 --> 00:28:47.140
11111111
And then the other thing we'd use it for is for classification.
@@ -1174,7 +1174,7 @@ Yeah.
11741174
It's pretty amazing.
11751175

11761176
00:30:06.320 --> 00:30:07.620
1177-
We like Spacey a lot.
1177+
We like spaCy a lot.
11781178

11791179
00:30:07.620 --> 00:30:11.740
11801180
I originally tried playing with NLTK a few years ago and actually kind of ran into some barriers.
@@ -1183,7 +1183,7 @@ I originally tried playing with NLTK a few years ago and actually kind of ran in
11831183
It's an old that's an older project.
11841184

11851185
00:30:13.380 --> 00:30:18.200
1186-
Spacey is really modern and that it's, you know, kind of does some of the best practices with Python.
1186+
spaCy is really modern and that it's, you know, kind of does some of the best practices with Python.
11871187

11881188
00:30:18.200 --> 00:30:19.500
11891189
I highly recommend it.
@@ -1207,7 +1207,7 @@ Looks really cool.
12071207
It definitely, they have it lined up to, when you go to visit spacey.io, it really looks appealing and polished.
12081208

12091209
00:30:37.160 --> 00:30:42.440
1210-
I was wondering why you didn't choose, what the difference or what made you choose Spacey over NLTK?
1210+
I was wondering why you didn't choose, what the difference or what made you choose spaCy over NLTK?
12111211

12121212
00:30:42.560 --> 00:30:44.880
12131213
It's actually pretty obvious straight away, isn't it?

transcripts/181-thirty-python-projects-of-2018.txt

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -416,13 +416,13 @@
416416

417417
00:11:55 Yeah.
418418

419-
00:11:55 So going back to machine learning, our next one is Spacey.
419+
00:11:55 So going back to machine learning, our next one is spaCy.
420420

421421
00:12:00 When you think, when I think, at least when I think of machine learning and like text understanding, natural language understanding, I always thought of NLTP.
422422

423423
00:12:09 That seemed like the way to go.
424424

425-
00:12:11 But Spacey is the new fancy way to do that apparently.
425+
00:12:11 But spaCy is the new fancy way to do that apparently.
426426

427427
00:12:15 And it's written in Python and Cython.
428428

@@ -438,7 +438,7 @@
438438

439439
00:12:31 If you say wanted to, maybe you were an algorithmic trader and you were studying the live flow of data on Twitter and trying to look for sentiment analysis around a particular stock and then having automatic trading happen on that.
440440

441-
00:12:44 You probably would put Spacey in there.
441+
00:12:44 You probably would put spaCy in there.
442442

443443
00:12:46 Yeah, sure.
444444

transcripts/181-thirty-python-projects-of-2018.vtt

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -628,7 +628,7 @@ It's probably pretty good, pretty nicely put together.
628628
Yeah.
629629

630630
00:11:55.820 --> 00:12:00.780
631-
So going back to machine learning, our next one is Spacey.
631+
So going back to machine learning, our next one is spaCy.
632632

633633
00:12:00.780 --> 00:12:09.080
634634
When you think, when I think, at least when I think of machine learning and like text understanding, natural language understanding, I always thought of NLTP.
@@ -637,7 +637,7 @@ When you think, when I think, at least when I think of machine learning and like
637637
That seemed like the way to go.
638638

639639
00:12:11.640 --> 00:12:15.160
640-
But Spacey is the new fancy way to do that apparently.
640+
But spaCy is the new fancy way to do that apparently.
641641

642642
00:12:15.160 --> 00:12:17.920
643643
And it's written in Python and Cython.
@@ -661,7 +661,7 @@ Anytime you want to take text and understand what the words mean, this is it, ri
661661
If you say wanted to, maybe you were an algorithmic trader and you were studying the live flow of data on Twitter and trying to look for sentiment analysis around a particular stock and then having automatic trading happen on that.
662662

663663
00:12:44.960 --> 00:12:46.820
664-
You probably would put Spacey in there.
664+
You probably would put spaCy in there.
665665

666666
00:12:46.820 --> 00:12:47.080
667667
Yeah, sure.

transcripts/202-software-biz.txt

Lines changed: 17 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@
88

99
00:00:12 You could take things up a notch and change the equation and exchange value for money.
1010

11-
00:00:17 That's what Ines Montani and her co-founder did when they started Explosion.ai with Spacey as a foundation.
11+
00:00:17 That's what Ines Montani and her co-founder did when they started Explosion.ai with spaCy as a foundation.
1212

1313
00:00:23 Listen to her story about building a sustainable software business on open source and Python.
1414

@@ -82,9 +82,9 @@
8282

8383
00:02:11 But I guess that ties more into your linguistics side of things, right?
8484

85-
00:02:14 Yeah, and Spacey, especially.
85+
00:02:14 Yeah, and spaCy, especially.
8686

87-
00:02:15 Like Spacey, when I started working on Spacey, I realized that, wow, I can really combine what I liked.
87+
00:02:15 Like spaCy, when I started working on spaCy, I realized that, wow, I can really combine what I liked.
8888

8989
00:02:20 For example, front-end development with a bit more programming, other types of stuff with language
9090

@@ -108,15 +108,15 @@
108108

109109
00:02:46 Yeah, so our company is called Explosion AI, and we mostly develop a lot of open source software.
110110

111-
00:02:50 One of those projects is Spacey, which is a popular library for NLP and Python.
111+
00:02:50 One of those projects is spaCy, which is a popular library for NLP and Python.
112112

113113
00:02:55 We also publish other products and other tools for software development developers.
114114

115115
00:03:01 So we're a developer tools company.
116116

117117
00:03:03 It really sounds like such an interesting project.
118118

119-
00:03:04 And I love how you have Spacey and you kind of built a proper business on top of it,
119+
00:03:04 And I love how you have spaCy and you kind of built a proper business on top of it,
120120

121121
00:03:11 not, you know, just throw like a PayPal donate button on there and hope that that does something right.
122122

@@ -222,7 +222,7 @@
222222

223223
00:06:05 So let's talk about your journey and your project, your company, Explosion AI.
224224

225-
00:06:11 So I guess maybe just set the background, like how does Explosion AI and Spacey,
225+
00:06:11 So I guess maybe just set the background, like how does Explosion AI and spaCy,
226226

227227
00:06:16 and we'll talk about some of the products you have, how does that whole history fit together and whatnot?
228228

@@ -234,7 +234,7 @@
234234

235235
00:06:30 We're finally getting to a point where this can actually be used in like commercial projects and help companies.
236236

237-
00:06:35 So he left academia and started writing Spacey.
237+
00:06:35 So he left academia and started writing spaCy.
238238

239239
00:06:38 So he basically, he was in Australia, he moved to Berlin.
240240

@@ -248,7 +248,7 @@
248248

249249
00:06:56 And that's also something I was very interested in.
250250

251-
00:06:58 So we started working on Spacey and this worked very well.
251+
00:06:58 So we started working on spaCy and this worked very well.
252252

253253
00:07:02 So yeah, we founded Explosion AI together.
254254

@@ -260,7 +260,7 @@
260260

261261
00:07:15 So we really, there were lots of companies who were interested in that.
262262

263-
00:07:18 So we did have the advantage that we had Spacey.
263+
00:07:18 So we did have the advantage that we had spaCy.
264264

265265
00:07:20 People liked our work, wanted to implement this better.
266266

@@ -378,7 +378,7 @@
378378

379379
00:10:10 for machine learning projects and run experiments because the idea is, okay,
380380

381-
00:10:15 if you're using a library like Spacey or any other machine learning solution,
381+
00:10:15 if you're using a library like spaCy or any other machine learning solution,
382382

383383
00:10:18 you ultimately, once it's really giving you a lot of value, you want to train
384384

@@ -428,13 +428,13 @@
428428

429429
00:11:26 If you can write it in Python, you can use it in Prodigy.
430430

431-
00:11:28 Yeah, it's really cool because most people doing something with Spacey or other machine learning
431+
00:11:28 Yeah, it's really cool because most people doing something with spaCy or other machine learning
432432

433433
00:11:34 are, chances are they're already somewhat competent with Python anyway, right?
434434

435435
00:11:38 And so you might as well just leverage that and that sounds really cool.
436436

437-
00:11:42 And also a big reason people are using Spacey and open source tools like that
437+
00:11:42 And also a big reason people are using spaCy and open source tools like that
438438

439439
00:11:45 is because they want to build their own systems and implement them themselves
440440

@@ -660,11 +660,11 @@
660660

661661
00:17:30 So if you make it, like we've always wanted to have great docs, wanted it to be easy
662662

663-
00:17:33 for people to use Spacey.
663+
00:17:33 for people to use spaCy.
664664

665-
00:17:34 So if it's too easy to use Spacey, people don't need our services and we make less money.
665+
00:17:34 So if it's too easy to use spaCy, people don't need our services and we make less money.
666666

667-
00:17:38 But if Spacey is shit, we're losing business that way because people are not
667+
00:17:38 But if spaCy is shit, we're losing business that way because people are not
668668

669669
00:17:43 coming to us anymore and the library loses adoption.
670670

@@ -680,7 +680,7 @@
680680

681681
00:17:54 and actually, in fact, there are lots of consultants who are somewhat specializing
682682

683-
00:17:58 on building stuff with Spacey and it works well for them and we think that's much better
683+
00:17:58 on building stuff with spaCy and it works well for them and we think that's much better
684684

685685
00:18:02 for us than if we did all of that.
686686

@@ -852,7 +852,7 @@
852852

853853
00:22:34 that were actually built by pretty small teams, right?
854854

855-
00:22:37 there's not that many people behind Spacey and that's super successful.
855+
00:22:37 there's not that many people behind spaCy and that's super successful.
856856

857857
00:22:39 There's not that many people behind Flask, right?
858858

0 commit comments

Comments
 (0)