Seinfeld Neural seq2seq Chatbot

Final Capstone Project for Galvanize Data Science Immersive

by Matt Devor

Introduction

I’ve been a Seinfeld fan since the early 90’s, so when I discovered a database of episode scripts, I thought that would be a great opportunity to explore the corpus of dialogue, and learn much more about NLP, recurrent neural networks, and chatbots in the process.

Note that the initial part of this project, containing all EDA, topic modeling, sentiment analysis, and initial text generation attempts can be found here.

As such, I will not be spending as much time in this README discussing those elements of the analysis.

Back to Top

Objectives

Use EDA to completely understand the corpus of dialogue and episode information.
Create a character-based sequence to sequence (seq2seq) LSTM (Long Short-Term Memory) recurrent neural network using dialogue from the dataset to create a chatbot that speaks like Jerry Seinfeld. _ Deploy a Flask app that allows users to have a conversation with the Jerry chatbot.

Back to Top

Data

56,000 lines of dialogue in the series.
~14,500 lines spoken by Jerry.
1,600 different characters have dialogue in the series, but Jerry, George, Elaine, and Kramer have the bulk of the lines.

Figure 1. Lines Spoken per Character – All Seasons

Back to Top

Processs Flow

Flask App Processs Flow

Back to Top

Results

The final seq2seq model has a train accuracy of 53% after 100 epochs, and a validation accuracy of 22%.
While these values may seem low, it is notoriously difficult to evaluate text generation models, as we don’t necessarily want the output to be deterministic.
As with any neural network, more data would be needed to allow the model to generalize well.

Other examples of funny/interesting conversations I've had with the Jerry Bot:

Jerry: Aw, we can't do it, who are we kidding.
You: Talk?
Jerry: Yeah.
You: Why not?
Jerry: You know its not like that.

Jerry: Boy, you got a lot of stuff here.
You: on the computer?
Jerry: What?
You: You live in the computer!
Jerry: Yeah.
You: Do you get lonely?
Jerry: I don't know. 

You: How are you?
Jerry: Good, good.
You: Can I be your friend?
Jerry: Hey, it's the Kman. 

You: I'm tired. 
Jerry: Alright, that's right.
You: Are you?
Jerry: I can't believe it.
You: Do you get tired?
Jerry: I don't know.
You: Cause you're a bot!
Jerry: Oh, Elaine. 

You: I'm DONE with this BOT!
Jerry: Hey, look at this guy!

You: Is George really a marine biologist? 
Jerry: No, really. 
You: What does that mean? 
Jerry: Well. 
You: Is it true or not?? 
Jerry: Wow. 
You: Too many question marks?
Jerry: All right.

Back to Top

Flask App

Here is a screenshot from the flask app I created, allowing users to seamlessly interact with the Jerry Bot. Note the late 90's website design was completely intentional, as I thought it would fit well with the time period of the show ;)

Ideally, I'd like to deploy this on a free tier AWS instance as well, in order to let others interact with Jerry, and maybe even offer up suggestions for improvements.

Conclusion

A model incorporating a seq2seq LSTM was able to learn the speech patterns of Jerry Seinfeld relatively well, but it is far from perfect. As we can see, the model is responding using correct punctuation and sentence structure, for the most part.

A few things that could potentially help any seq2seq model improve are:

More data – helps the model generalize well in that it will respond intelligently to unseen questions.
Additional LSTM layers – allows model to learn longer sequences.

Other applications for seq2seq models include:

Machine translation
Image Captioning
Text summarization

Back to Top

Next Steps

Word-based model using BERT word embedding vectors.
Continue refining model architecture and incorporate attention mechanisms.
Create George, Elaine, and Kramer chatbots and have them interact.

Back to Top

Appendix

Acknowledgements and Inspiration

Thanks very much to Frank, Danny, and Kayla at Galvanize for all of the guidance, instruction, and wisdom. Without them, none of this would have been even remotely possible, and I will always be grateful for their knowledge and expertise.
A ten-minute introduction to sequence-to-sequence learning in Keras
How to Develop an Encoder-Decoder Model for Sequence-to-Sequence Prediction in Keras
NLP | Sequence to Sequence Networks
How I Used Deep Learning to Train a Chatbot to Talk Like Me (Sorta)

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
app		app
data		data
images		images
models		models
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Seinfeld Neural seq2seq Chatbot

Table of Contents

Introduction

Objectives

Data

Processs Flow

Flask App Processs Flow

Results

Other examples of funny/interesting conversations I've had with the Jerry Bot:

Flask App

Conclusion

Next Steps

Appendix

Acknowledgements and Inspiration

Tech Stack Used

About

Uh oh!

Releases

Packages

Languages

MattD82/Seinfeld-Neural-seq2seq-Chatbot

Folders and files

Latest commit

History

Repository files navigation

Seinfeld Neural seq2seq Chatbot

Table of Contents

Introduction

Objectives

Data

Processs Flow

Flask App Processs Flow

Results

Other examples of funny/interesting conversations I've had with the Jerry Bot:

Flask App

Conclusion

Next Steps

Appendix

Acknowledgements and Inspiration

Tech Stack Used

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages