#A Collection of NLP notes
##N-grams
###Calculating unigram probabilities:
P( wi ) = count ( wi ) ) / count ( total number of words )
In english..
<?xml version="1.0" encoding="utf-16"?> | |
<ShowPlanXML xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xsd="http://www.w3.org/2001/XMLSchema" Version="1.2" Build="12.0.4436.0" xmlns="http://schemas.microsoft.com/sqlserver/2004/07/showplan"> | |
<BatchSequence> | |
<Batch> | |
<Statements> | |
<StmtSimple StatementCompId="1" StatementEstRows="60.0079" StatementId="1" StatementOptmLevel="FULL" CardinalityEstimationModelVersion="120" StatementSubTreeCost="1034.33" StatementText="select 
 od.[Year], 
 AvgValue = avg(ObservationValue)
from dbo.ObservationDates od
join dbo.v_Observation o
 on o.ObservationDateKey = od.DateKey
where 
 od.[Year] >= 2000 and od.[Year] < 2006
group by 
 od.[Year]
option (querytraceon 4199)" StatementType="SELECT" QueryHash="0xE01A45FFCD2D134E" QueryPlanHash="0x9EFEF831BB146D" RetrievedFromCache="false"> | |
<StatementSetOptions ANSI_NULLS="true" ANSI_PADDIN |
Get it | |
http://www.rstudio.com/ide/download/desktop | |
http://en.wikipedia.org/wiki/R_(programming_language) | |
In R, the widely preferred assignment operator is an arrow made from two characters "<-", although "=" can be used instead. | |
Set up | |
install.packages('ggplot2') | |
install.packages('RSocrata') | |
Lurning |
#A Collection of NLP notes
##N-grams
###Calculating unigram probabilities:
P( wi ) = count ( wi ) ) / count ( total number of words )
In english..
Special thanks to Dušan Majkic (dmajkic, https://github.com/dmajkic/redis/) for his project on GitHub that gave us the opportunity to quickly learn some on the intricacies of Redis code. His project also helped us to build our prototype quickly.
First clone the Redis sources from https://github.com/antirez/redis.