koenderks
About
- Username
- koenderks
- Joined
- Visits
- 119
- Last Active
- Roles
- Member
Comments
-
Yes, this looks good! You now have a balanced test set and you can use the 'testIndicator' variable as the 'Test set indicator' (as shown below) to index the test set when training any of the algorithms in the machine learning module! https://forum.…
-
Hi may01dz, You cannot do this in the machine learning module directly but requires the manual inspection of the data and specification of a variable in the data set that indicates which observations belong to the test set (indicated by 1, you have …
-
Dear Faming, Unfortunately, I believe we currently do not yet support cross validation for random forest analyses, however this is on the agenda. We will take your request into account and see if we can implement it at the time! Best, Koen
-
Hi Faming, You can view the importance of the features in predicting the target variable by clicking the option “Feature importance” in the interface, which produces a table with the mean decrease in accuracy (when the feature is excluded) and the t…
-
They might have the same values for all variables? I can’t be sure without the data set. Best, koen
-
Hi, You can visualize the weighting scheme via a plot in the k-nn analysis. This will plot the weight as a function of the relative distance of the neighbors. See also https://epub.ub.uni-muenchen.de/1769/1/paper_399.pdf about more exact info abou…
-
I would say this partially depends on whether parameters in the algorithm are optimized under "Training parameters". In the k-nearest neighbors algorithm, the optimal number of neighbors is trained on the training set and after that optimi…
-
I’m not sure that I follow. If you fix the seed then the results should be the same every time you run the analysis :)
-
Each time you run the analysis it randomly selects a training(, validation) and test set to use, so it is expected that the results will be different across runs. You can disable this behavior of the analysis by fixing the seed in the training param…
-
Yes the best way to go is to uncheck the ‘scale predictors’ box. This way the raw data is used for everything. If you are missing a feature in the evaluation metrics table, could you please suggest it via out github page: https://github.com/jasp-s…
-
Hi profwriter, Sorry for the late reply. I believe what you are saying is correct: not influencing purchase (2) Is coded here as the value 1 in the logistic regression. That means the logistic regression is about not influencing purchase (2). You …
-
Hi Manon, Can you try the the following steps: Enter your raw data (no z-scores) in the "Variables" box in the cluster analysis. Go to "Advanced options" and make sure "scale variables" is off. Click to box "Set s…
-
Hi Manon, If the cluster memberships are in the variable "macro VS micro" then this variable should be dragged to the "Split" box. You should then insert all the variables you have used in your initial cluster analysis in the box…
-
Hi Manon, Here I am again. From the top of my head these 0’s are taken into account. However, it is hard to see without looking at what you are looking at. Is it possible for you to save your analyses and upload them as a .jasp file (the default jas…
-
Hi Manon, This can be achieved by exporting the clusters to your data set by clicking “Add predictions to data” and filling in a name for the new column. Then, you can go to the descriptives analysis and use this new column to split the data (i.e., …
-
Hi Andrearicci, Whenever you have performed a cluster analysis and have obtained clusters, you can add a variable with the cluster memberships to your data set by clicking “Add predictions to data” and filling in a column name for the new column. Th…
-
Dear Manon, Let me try to answer your questions! 1- Is there any easy-to-use guide about k-mean clustering with detailled informations (and maybe step-by-step procedure) ? I'm still struggling with the understanding of the software. Currently ther…
-
Hi Johan, It seems to me like you want to be able to state after your sample that the misstatement in the population is lower than 5,000,000 (10 percent of the population size/value). If you want to make this statement with a certain amount of confi…
-
Hi Mateus, I understand what you mean now! I actually don't think you can currently make the exact plot that you want in JASP. However, there are some ways in which we can cheat ourselves to something very similar. The first alternative I guess wou…
-
Hi Mateus, I'm having a little trouble understanding what kind of plot it is that you desire. From your second plot, it seems like you have already obtained a grouped scatter plot that represents the distribution of the clusters, as well as their de…
-
Hi vps2020, Good spot. That is because, by default, the predicted values are scaled/standardized to have a mean of zero and a standard deviation of one (see picture below). This is good practice in training a ml model, but this causes the predicted …
-
Hi vps2020, Great that you like the program and the machine learning module! To elaborate on your question regarding the relative importance of the variables in a random forest regression model, these can be requested in JASP via the "Variable …
-
Alright, good! Sorry for the hassle, in the next release of JASP Bain will get rid of it "Beta" status and these issues will be fixed.
-
If not, may I suggest that you download the try (nightly) version of JASP at http://static.jasp-stats.org/Nightlies/. I definately got it to work there! Can send you the output if you don't get it working.
-
Thanks. What seemed to work for me what changing the values of "Drug A" and Drug B" in the .csv file. Apparently bain cannot handle those spaces well. Mental note is made. Here is the .csv file I used (with changed values) https://fo…
-
Hi Mark, Can you send me the dataset, or tell me which data set you have used? Maybe I can try to replicate it and find a solution. Koen
-
Even though Bain is cool, the B) smiley should be a "B )"
-
Hi Mark, This is an error message that comes directly from the Bain package instead of JASP and occurs when the package does not recognize the grouping names from the hypotheses in the data. However, I think it could be due to the space in you grou…
-
Hi, There does not appear to be an image in your post. Can you provide either that, or the .jasp file that produced the error? Best, Koen