stevecreedon

## gist:b145e07a1d9056fd8bb1afcd2c383a57
### Keybase proof

I hereby claim:

  * I am stevecreedon on github.
  * I am stevecreedon_cat (https://keybase.io/stevecreedon_cat) on keybase.
  * I have a public key ASAR9J7Ys7MmQO5ylyehskr2kafV6ff_UHKTIvsFieDGdQo

To claim this, I am signing this object:

## gist:8966a3cf9afb51cb44a7e9afdda2f230
### Keybase proof

I hereby claim:

  * I am stevecreedon on github.
  * I am stevecreedon_cat (https://keybase.io/stevecreedon_cat) on keybase.
  * I have a public key ASAR9J7Ys7MmQO5ylyehskr2kafV6ff_UHKTIvsFieDGdQo

To claim this, I am signing this object:

## keybase.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                stevecreedon
                / keybase.md
            
            
              Created
              January 25, 2018 19:04
            
          
    Keybase proof

I hereby claim:

I am stevecreedon on github.
I am stevecreedon (https://keybase.io/stevecreedon) on keybase.
I have a public key ASCFoLDFFgWiluQMVvgldwEJLh53F9n3ocCpKcPbLBCfywo

To claim this, I am signing this object:

  
## gist:de563ade880488b213b989543b9cc931
At resolver.co.uk we need some form of topic discovery from our large corpus of email conversations. I'm not trying to understand LDA analysis in its full scientific or mathematical depth but just how it works so we can attempt to get the best out of it.

This is my best shot:

Say we have 1000 documents.

Let's make some huge assumptions:

1. Distributed across these documents we have 10 topics. We don't know the topics and we don't know which document has which of these mystery topics.
	### Keybase proof

	I hereby claim:

	* I am stevecreedon on github.
	* I am stevecreedon_cat (https://keybase.io/stevecreedon_cat) on keybase.
	* I have a public key ASAR9J7Ys7MmQO5ylyehskr2kafV6ff_UHKTIvsFieDGdQo

	To claim this, I am signing this object:
	At resolver.co.uk we need some form of topic discovery from our large corpus of email conversations. I'm not trying to understand LDA analysis in its full scientific or mathematical depth but just how it works so we can attempt to get the best out of it.

	This is my best shot:

	Say we have 1000 documents.

	Let's make some huge assumptions:

	1. Distributed across these documents we have 10 topics. We don't know the topics and we don't know which document has which of these mystery topics.