SparkR :: gapply How to use LinearRegression across groups in DataFrame?

Geschlossen Veröffentlicht vor 2 Jahren Bezahlt bei Lieferung
Geschlossen

Hi there

I have big data which I am using for applying linear model to each group. I have small example of the data for the principle I want to have parallelised.

# Determine six waiting times with the largest eruption time in minutes.

schema <- structType(structField("waiting", "double"), structField("max_eruption", "double"))

result <- gapply(

df,

"waiting",

function(key, x) {

y <- [login to view URL](key, max(x$eruptions))

},

schema)

head(collect(arrange(result, "max_eruption", decreasing = TRUE)))

Datensuche R Programmiersprache

Projekt-ID: #30580205

Über das Projekt

4 Vorschläge Remote Projekt Aktiv vor 2 Jahren

4 Freelancer bieten im Durchschnitt €10/Stunde für diesen Job

Annmarie1995

Hi I am a professional statistician with 5 years of experience. I have read the job description. I will help you complete the project. i have skills in Data Mining and R Programming Language. I can deliver quality an Mehr

€16 EUR / Stunde
(23 Bewertungen)
4.9
WycOj

EXPERT IN STATISTICS Hello there, I am best in statistics, R programming analysis of data, SPSS, Statistical/Data Analysis, Multivariate Statistical Analysis, Regression Analysis, STATA, MINITAB, R language, Factor Ana Mehr

€10 EUR / Stunde
(19 Bewertungen)
4.4
ibahimakerkouch

Hi, I have a big experience on R programming also I am a master's degree in data science. You can see my profile and my reviews to prove to you that I worked well on R projects. Your project is a challenge for me. Le Mehr

€4 EUR / Stunde
(20 Bewertungen)
4.3
StatisticandArt

Hi, I graduated Bachelor of Statistics. I have experience using R because that application have been learned when i was college. I am also a specialist in Basic Statistical Analysis (descriptive analysis, graph, chart Mehr

€8 EUR / Stunde
(10 Bewertungen)
3.2