Skip to content

Instantly share code, notes, and snippets.

View ramnov's full-sized avatar

Ramkumar Chandrasekaran ramnov

View GitHub Profile
@ramnov
ramnov / dataFrameWorkflow.R
Last active August 1, 2017 00:00
Microsoft R Server Operationalization Sample Code with data frame consumption
##########################################################
# Create & Test a Logistic Regression Model #
##########################################################
# load mrsdeploy package on R Server
library(mrsdeploy)
# Use logistic regression equation of vehicle transmission
# in the data set mtcars to estimate the probability of
# a vehicle being fitted with a manual transmission
@ramnov
ramnov / swagger.R
Last active November 30, 2017 20:03
generate client library from swagger.json in R
# Generate Service Consumption Client Library in CSharp from swagger.json
install.packages("httr", dependencies = TRUE)
library(httr)
serviceName <- "ManualTransmissionService"
swaggerFileLocation <- "C:/Users/azureuser/Documents/swagger.json"
swagger <- readChar(swaggerFileLocation, file.info(swaggerFileLocation)$size)
r <- httr::POST("http://generator.swagger.io/api/gen/clients/csharp",
add_headers("Content-Type" = "application/json"),
body = paste("{\"options\": { \"packageName\" : \"", serviceName, "\" } ,",
"\"spec\":", swagger, "}"))
@ramnov
ramnov / rxoneclasssvm.R
Created May 23, 2017 05:46
usage of oneclasssvm
# Estimate a One-Class SVM model
trainRows <- c(1:30, 51:80, 101:130)
testRows = !(1:150 %in% trainRows)
trainIris <- iris[trainRows,]
testIris <- iris[testRows,]
trainIris$id <- seq.int(nrow(trainIris))
svmModel <- rxOneClassSvm(
formula = ~Sepal.Length + Sepal.Width + Petal.Length + Petal.Width,
data = trainIris)
@ramnov
ramnov / Program.cs
Created May 19, 2017 21:04
C# code to install cert in Root and CurrentUser
using System;
using System.Collections.Generic;
using System.Linq;
using System.Security.Cryptography.X509Certificates;
namespace InstallCert
{
public class Program
{
public static void Main(string[] args)
@ramnov
ramnov / readLargeDataADLS.R
Created May 5, 2017 17:13
Read large data from ADLS into R
library(httr)
r <- httr::GET("https://<yourstorename>.azuredatalakestore.net/webhdfs/v1/mytempdir/iris.csv?op=OPEN&read=true",
add_headers(Authorization = "Bearer <AD AUTH TOKEN>"),
write_disk("iris.csv", overwrite=TRUE), progress())
irisData <- rxImport("iris.csv")
@ramnov
ramnov / rstudioserverStream.R
Created May 2, 2017 23:22
Test RStudio Server in Local Compute Context with HDFS data
# Work with HDFS data in local compute context
rxHadoopMakeDir("/share/SampleData")
rxHadoopCopyFromLocal(file.path(dataPath = rxGetOption("sampleDataDir"), "AirlineDemoSmall.csv"), "/share/SampleData")
airDS <- RxTextData(file = "/share/SampleData/AirlineDemoSmall.csv", missingValueString = "M",
fileSystem = RxHdfsFileSystem())
rxSetComputeContext("local")
adsSummary <- rxSummary(~ArrDelay+CRSDepTime+DayOfWeek, data = airDS)
print(adsSummary)
@ramnov
ramnov / rstudioserverSpark.R
Last active April 30, 2017 22:17
RStudio Server example code to run MRS in Spark compute Context
# Spark Compute Context
sparkCC <- rxSparkConnect()
rxHadoopMakeDir("/share/SampleData")
rxHadoopCopyFromLocal(file.path(dataPath = rxGetOption("sampleDataDir"), "AirlineDemoSmall.csv"), "/share/SampleData")
airDS <- RxTextData(file = "/share/SampleData/AirlineDemoSmall.csv", missingValueString = "M",
fileSystem = RxHdfsFileSystem())
adsSummary <- rxSummary(~ArrDelay+CRSDepTime+DayOfWeek, data = airDS)
print(adsSummary)
rxSparkDisconnect(sparkCC)
@ramnov
ramnov / rstudioserverHadoop.R
Last active April 30, 2017 22:17
rstudio server example code to run MRS in Hadoop Compute Context
# Hadoop Compute Context
rxSetComputeContext(RxHadoopMR())
rxHadoopMakeDir("/share/SampleData")
rxHadoopCopyFromLocal(file.path(dataPath = rxGetOption("sampleDataDir"), "AirlineDemoSmall.csv"), "/share/SampleData")
airDS <- RxTextData(file = "/share/SampleData/AirlineDemoSmall.csv", missingValueString = "M",
fileSystem = RxHdfsFileSystem())
adsSummary <- rxSummary(~ArrDelay+CRSDepTime+DayOfWeek, data = airDS)
print(adsSummary)
@ramnov
ramnov / rstudioserverLocal.R
Last active April 30, 2017 21:34
Test RStudio Server with Local Compute Context
# Local Compute Context
rxSetComputeContext("local")
airDS <- RxTextData(file.path(rxGetOption("sampleDataDir"), "AirlineDemoSmall.csv"))
adsSummary <- rxSummary(~ArrDelay+CRSDepTime+DayOfWeek, data = airDS)
print(adsSummary)
@ramnov
ramnov / stratifiedXdfSplit.R
Last active July 12, 2017 19:23
Stratified Splitting on AirlineDemoSmall XDF data using rxExecBy()
# Set Seed
set.seed(12345)
# AirlineDemoSmall XDF File
airline <- RxXdfData(file.path(rxGetOption("sampleDataDir"), "AirlineDemoSmall.xdf"))
# Define a split function which splits the dataset into 75% train and 25% test using rxSplit
".split" <- function(keys, data)
{
dataSplit <- rxSplit(data,