How can I improve the accuracy of my confusion matrix to 100%? [on hold] The 2019 Stack Overflow Developer Survey Results Are In Unicorn Meta Zoo #1: Why another podcast? Announcing the arrival of Valued Associate #679: Cesar Manara 2019 Moderator Election Q&A - Questionnaire 2019 Community Moderator Election ResultsImprove k-means accuracyCan training label confidence be used to improve prediction accuracy?How to improve my test accuracy using CNN in TensorflowTrain Accuracy vs Test Accuracy vs Confusion matrixImprove test accuracy for TensorFlow CNNHow to best estimate the coefficients of a confusion matrix in case of strong class imbalance?How can I improve the accuracy of my neural network on a very unbalanced dataset?Why 100% accuracy on test data is not good?How to build confusion matrix , when predicted value and actual value is in sentence?How to reach continue training in xgboost

60's-70's movie: home appliances revolting against the owners

What force causes entropy to increase?

Didn't get enough time to take a Coding Test - what to do now?

What is the padding with red substance inside of steak packaging?

Fixing different display colors within string

Sub-subscripts in strings cause different spacings than subscripts

One-dimensional Japanese puzzle

Is this wall load bearing? Blueprints and photos attached

Can the Right Ascension and Argument of Perigee of a spacecraft's orbit keep varying by themselves with time?

Would an alien lifeform be able to achieve space travel if lacking in vision?

Circular reasoning in L'Hopital's rule

Is 'stolen' appropriate word?

Example of compact Riemannian manifold with only one geodesic.

Using dividends to reduce short term capital gains?

Why not take a picture of a closer black hole?

Windows 10: How to Lock (not sleep) laptop on lid close?

Is every episode of "Where are my Pants?" identical?

"is" operation returns false even though two objects have same id

Drawing vertical/oblique lines in Metrical tree (tikz-qtree, tipa)

Why don't hard Brexiteers insist on a hard border to prevent illegal immigration after Brexit?

ELI5: Why do they say that Israel would have been the fourth country to land a spacecraft on the Moon and why do they call it low cost?

What's the point in a preamp?

How to determine omitted units in a publication

Variable with quotation marks "$()"



How can I improve the accuracy of my confusion matrix to 100%? [on hold]



The 2019 Stack Overflow Developer Survey Results Are In
Unicorn Meta Zoo #1: Why another podcast?
Announcing the arrival of Valued Associate #679: Cesar Manara
2019 Moderator Election Q&A - Questionnaire
2019 Community Moderator Election ResultsImprove k-means accuracyCan training label confidence be used to improve prediction accuracy?How to improve my test accuracy using CNN in TensorflowTrain Accuracy vs Test Accuracy vs Confusion matrixImprove test accuracy for TensorFlow CNNHow to best estimate the coefficients of a confusion matrix in case of strong class imbalance?How can I improve the accuracy of my neural network on a very unbalanced dataset?Why 100% accuracy on test data is not good?How to build confusion matrix , when predicted value and actual value is in sentence?How to reach continue training in xgboost










1












$begingroup$


Is there a possibility of attaining the above? Can someone share with me how to go about doing it if it is?










share|improve this question









New contributor




Renae is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$



put on hold as too broad by Mark.F, Dawny33 yesterday


Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.













  • 3




    $begingroup$
    Whether or not this is possible and realistic goal depends heavily on the problem domain and nature of your data. Could you please share some more details by using edit to add this information to the question?
    $endgroup$
    – Neil Slater
    yesterday















1












$begingroup$


Is there a possibility of attaining the above? Can someone share with me how to go about doing it if it is?










share|improve this question









New contributor




Renae is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$



put on hold as too broad by Mark.F, Dawny33 yesterday


Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.













  • 3




    $begingroup$
    Whether or not this is possible and realistic goal depends heavily on the problem domain and nature of your data. Could you please share some more details by using edit to add this information to the question?
    $endgroup$
    – Neil Slater
    yesterday













1












1








1


1



$begingroup$


Is there a possibility of attaining the above? Can someone share with me how to go about doing it if it is?










share|improve this question









New contributor




Renae is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$




Is there a possibility of attaining the above? Can someone share with me how to go about doing it if it is?







machine-learning python






share|improve this question









New contributor




Renae is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











share|improve this question









New contributor




Renae is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|improve this question




share|improve this question








edited yesterday









Community

1




1






New contributor




Renae is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked yesterday









RenaeRenae

164




164




New contributor




Renae is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





Renae is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






Renae is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.




put on hold as too broad by Mark.F, Dawny33 yesterday


Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.









put on hold as too broad by Mark.F, Dawny33 yesterday


Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.









  • 3




    $begingroup$
    Whether or not this is possible and realistic goal depends heavily on the problem domain and nature of your data. Could you please share some more details by using edit to add this information to the question?
    $endgroup$
    – Neil Slater
    yesterday












  • 3




    $begingroup$
    Whether or not this is possible and realistic goal depends heavily on the problem domain and nature of your data. Could you please share some more details by using edit to add this information to the question?
    $endgroup$
    – Neil Slater
    yesterday







3




3




$begingroup$
Whether or not this is possible and realistic goal depends heavily on the problem domain and nature of your data. Could you please share some more details by using edit to add this information to the question?
$endgroup$
– Neil Slater
yesterday




$begingroup$
Whether or not this is possible and realistic goal depends heavily on the problem domain and nature of your data. Could you please share some more details by using edit to add this information to the question?
$endgroup$
– Neil Slater
yesterday










2 Answers
2






active

oldest

votes


















3












$begingroup$


Bayes error




To answer your question, first I should explain Bayes error. Assuming we know the exact joint distribution of feature vectors ($mathbfx$) and each class ($C_k$) as
$P(mathbfx,C_k)$, we build a classifier which assigns label $k$ to each feature vector by this criteria $$mathop arg max limits_k P(left. C_k right|mathbfx)$$
It can be shown that this is the best possible classifier by calculating the expected classification error on the whole feature space. This expected classification error is called Bayes error and is the minimum achievable classification error for this feature-label space.




Training error




If you evaluate your model on the training data and calculate confusion matrix using the training samples you may achieve 100% accuracy because your model may overfit your training data. It means your training error is 0 even the Bayes error may not be.




Generalization error




If you evaluate your model on the test data and calculate confusion matrix using the test samples you can not achieve 100% accuracy because you are evaluating the generalization capability of your model and its error can not be less than Bayes error.






share|improve this answer









$endgroup$








  • 2




    $begingroup$
    This is the most realistic and general answer for the question as written. However, OP might be working on a problem where 100% generalisation accuracy is theoretically possible (i.e. Bayes error is zero). They might also have a data set where it is practical to train for this goal (enough coverage of function domain that a ML approximation can get arbitrarily close to 100% accuracy). These things are highly unlikely for many real world examples, but perhaps OP has some kind of special case.
    $endgroup$
    – Neil Slater
    yesterday










  • $begingroup$
    @NeilSlater I’m very proud you approved of my answer Neil. Thanks for your comments which led to further clarification.
    $endgroup$
    – pythinker
    yesterday


















0












$begingroup$

Achieving such accuracy is hard but not impossible, especially when you test your model in real life to see if the model can achieve the same accuracy or not, here are some tips that help to improve your model accuracy:



1- change the algorithm that you used to train your model, for example, if you use a traditional machine learning algorithm like SVM, try using one of the deep learning algorithms such as CNN.



2- Obtain more data, change the quality of your data, do augmentation for your data, do some pre-processing on your data, or try other pre-processing techniques if you did already.



for more see here or here or here






share|improve this answer









$endgroup$



















    2 Answers
    2






    active

    oldest

    votes








    2 Answers
    2






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    3












    $begingroup$


    Bayes error




    To answer your question, first I should explain Bayes error. Assuming we know the exact joint distribution of feature vectors ($mathbfx$) and each class ($C_k$) as
    $P(mathbfx,C_k)$, we build a classifier which assigns label $k$ to each feature vector by this criteria $$mathop arg max limits_k P(left. C_k right|mathbfx)$$
    It can be shown that this is the best possible classifier by calculating the expected classification error on the whole feature space. This expected classification error is called Bayes error and is the minimum achievable classification error for this feature-label space.




    Training error




    If you evaluate your model on the training data and calculate confusion matrix using the training samples you may achieve 100% accuracy because your model may overfit your training data. It means your training error is 0 even the Bayes error may not be.




    Generalization error




    If you evaluate your model on the test data and calculate confusion matrix using the test samples you can not achieve 100% accuracy because you are evaluating the generalization capability of your model and its error can not be less than Bayes error.






    share|improve this answer









    $endgroup$








    • 2




      $begingroup$
      This is the most realistic and general answer for the question as written. However, OP might be working on a problem where 100% generalisation accuracy is theoretically possible (i.e. Bayes error is zero). They might also have a data set where it is practical to train for this goal (enough coverage of function domain that a ML approximation can get arbitrarily close to 100% accuracy). These things are highly unlikely for many real world examples, but perhaps OP has some kind of special case.
      $endgroup$
      – Neil Slater
      yesterday










    • $begingroup$
      @NeilSlater I’m very proud you approved of my answer Neil. Thanks for your comments which led to further clarification.
      $endgroup$
      – pythinker
      yesterday















    3












    $begingroup$


    Bayes error




    To answer your question, first I should explain Bayes error. Assuming we know the exact joint distribution of feature vectors ($mathbfx$) and each class ($C_k$) as
    $P(mathbfx,C_k)$, we build a classifier which assigns label $k$ to each feature vector by this criteria $$mathop arg max limits_k P(left. C_k right|mathbfx)$$
    It can be shown that this is the best possible classifier by calculating the expected classification error on the whole feature space. This expected classification error is called Bayes error and is the minimum achievable classification error for this feature-label space.




    Training error




    If you evaluate your model on the training data and calculate confusion matrix using the training samples you may achieve 100% accuracy because your model may overfit your training data. It means your training error is 0 even the Bayes error may not be.




    Generalization error




    If you evaluate your model on the test data and calculate confusion matrix using the test samples you can not achieve 100% accuracy because you are evaluating the generalization capability of your model and its error can not be less than Bayes error.






    share|improve this answer









    $endgroup$








    • 2




      $begingroup$
      This is the most realistic and general answer for the question as written. However, OP might be working on a problem where 100% generalisation accuracy is theoretically possible (i.e. Bayes error is zero). They might also have a data set where it is practical to train for this goal (enough coverage of function domain that a ML approximation can get arbitrarily close to 100% accuracy). These things are highly unlikely for many real world examples, but perhaps OP has some kind of special case.
      $endgroup$
      – Neil Slater
      yesterday










    • $begingroup$
      @NeilSlater I’m very proud you approved of my answer Neil. Thanks for your comments which led to further clarification.
      $endgroup$
      – pythinker
      yesterday













    3












    3








    3





    $begingroup$


    Bayes error




    To answer your question, first I should explain Bayes error. Assuming we know the exact joint distribution of feature vectors ($mathbfx$) and each class ($C_k$) as
    $P(mathbfx,C_k)$, we build a classifier which assigns label $k$ to each feature vector by this criteria $$mathop arg max limits_k P(left. C_k right|mathbfx)$$
    It can be shown that this is the best possible classifier by calculating the expected classification error on the whole feature space. This expected classification error is called Bayes error and is the minimum achievable classification error for this feature-label space.




    Training error




    If you evaluate your model on the training data and calculate confusion matrix using the training samples you may achieve 100% accuracy because your model may overfit your training data. It means your training error is 0 even the Bayes error may not be.




    Generalization error




    If you evaluate your model on the test data and calculate confusion matrix using the test samples you can not achieve 100% accuracy because you are evaluating the generalization capability of your model and its error can not be less than Bayes error.






    share|improve this answer









    $endgroup$




    Bayes error




    To answer your question, first I should explain Bayes error. Assuming we know the exact joint distribution of feature vectors ($mathbfx$) and each class ($C_k$) as
    $P(mathbfx,C_k)$, we build a classifier which assigns label $k$ to each feature vector by this criteria $$mathop arg max limits_k P(left. C_k right|mathbfx)$$
    It can be shown that this is the best possible classifier by calculating the expected classification error on the whole feature space. This expected classification error is called Bayes error and is the minimum achievable classification error for this feature-label space.




    Training error




    If you evaluate your model on the training data and calculate confusion matrix using the training samples you may achieve 100% accuracy because your model may overfit your training data. It means your training error is 0 even the Bayes error may not be.




    Generalization error




    If you evaluate your model on the test data and calculate confusion matrix using the test samples you can not achieve 100% accuracy because you are evaluating the generalization capability of your model and its error can not be less than Bayes error.







    share|improve this answer












    share|improve this answer



    share|improve this answer










    answered yesterday









    pythinkerpythinker

    8191213




    8191213







    • 2




      $begingroup$
      This is the most realistic and general answer for the question as written. However, OP might be working on a problem where 100% generalisation accuracy is theoretically possible (i.e. Bayes error is zero). They might also have a data set where it is practical to train for this goal (enough coverage of function domain that a ML approximation can get arbitrarily close to 100% accuracy). These things are highly unlikely for many real world examples, but perhaps OP has some kind of special case.
      $endgroup$
      – Neil Slater
      yesterday










    • $begingroup$
      @NeilSlater I’m very proud you approved of my answer Neil. Thanks for your comments which led to further clarification.
      $endgroup$
      – pythinker
      yesterday












    • 2




      $begingroup$
      This is the most realistic and general answer for the question as written. However, OP might be working on a problem where 100% generalisation accuracy is theoretically possible (i.e. Bayes error is zero). They might also have a data set where it is practical to train for this goal (enough coverage of function domain that a ML approximation can get arbitrarily close to 100% accuracy). These things are highly unlikely for many real world examples, but perhaps OP has some kind of special case.
      $endgroup$
      – Neil Slater
      yesterday










    • $begingroup$
      @NeilSlater I’m very proud you approved of my answer Neil. Thanks for your comments which led to further clarification.
      $endgroup$
      – pythinker
      yesterday







    2




    2




    $begingroup$
    This is the most realistic and general answer for the question as written. However, OP might be working on a problem where 100% generalisation accuracy is theoretically possible (i.e. Bayes error is zero). They might also have a data set where it is practical to train for this goal (enough coverage of function domain that a ML approximation can get arbitrarily close to 100% accuracy). These things are highly unlikely for many real world examples, but perhaps OP has some kind of special case.
    $endgroup$
    – Neil Slater
    yesterday




    $begingroup$
    This is the most realistic and general answer for the question as written. However, OP might be working on a problem where 100% generalisation accuracy is theoretically possible (i.e. Bayes error is zero). They might also have a data set where it is practical to train for this goal (enough coverage of function domain that a ML approximation can get arbitrarily close to 100% accuracy). These things are highly unlikely for many real world examples, but perhaps OP has some kind of special case.
    $endgroup$
    – Neil Slater
    yesterday












    $begingroup$
    @NeilSlater I’m very proud you approved of my answer Neil. Thanks for your comments which led to further clarification.
    $endgroup$
    – pythinker
    yesterday




    $begingroup$
    @NeilSlater I’m very proud you approved of my answer Neil. Thanks for your comments which led to further clarification.
    $endgroup$
    – pythinker
    yesterday











    0












    $begingroup$

    Achieving such accuracy is hard but not impossible, especially when you test your model in real life to see if the model can achieve the same accuracy or not, here are some tips that help to improve your model accuracy:



    1- change the algorithm that you used to train your model, for example, if you use a traditional machine learning algorithm like SVM, try using one of the deep learning algorithms such as CNN.



    2- Obtain more data, change the quality of your data, do augmentation for your data, do some pre-processing on your data, or try other pre-processing techniques if you did already.



    for more see here or here or here






    share|improve this answer









    $endgroup$

















      0












      $begingroup$

      Achieving such accuracy is hard but not impossible, especially when you test your model in real life to see if the model can achieve the same accuracy or not, here are some tips that help to improve your model accuracy:



      1- change the algorithm that you used to train your model, for example, if you use a traditional machine learning algorithm like SVM, try using one of the deep learning algorithms such as CNN.



      2- Obtain more data, change the quality of your data, do augmentation for your data, do some pre-processing on your data, or try other pre-processing techniques if you did already.



      for more see here or here or here






      share|improve this answer









      $endgroup$















        0












        0








        0





        $begingroup$

        Achieving such accuracy is hard but not impossible, especially when you test your model in real life to see if the model can achieve the same accuracy or not, here are some tips that help to improve your model accuracy:



        1- change the algorithm that you used to train your model, for example, if you use a traditional machine learning algorithm like SVM, try using one of the deep learning algorithms such as CNN.



        2- Obtain more data, change the quality of your data, do augmentation for your data, do some pre-processing on your data, or try other pre-processing techniques if you did already.



        for more see here or here or here






        share|improve this answer









        $endgroup$



        Achieving such accuracy is hard but not impossible, especially when you test your model in real life to see if the model can achieve the same accuracy or not, here are some tips that help to improve your model accuracy:



        1- change the algorithm that you used to train your model, for example, if you use a traditional machine learning algorithm like SVM, try using one of the deep learning algorithms such as CNN.



        2- Obtain more data, change the quality of your data, do augmentation for your data, do some pre-processing on your data, or try other pre-processing techniques if you did already.



        for more see here or here or here







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered yesterday









        SoKSoK

        31814




        31814













            -machine-learning, python

            Popular posts from this blog

            Creating 100m^2 grid automatically using QGIS?Creating grid constrained within polygon in QGIS?Createing polygon layer from point data using QGIS?Creating vector grid using QGIS?Creating grid polygons from coordinates using R or PythonCreating grid from spatio temporal point data?Creating fields in attributes table using other layers using QGISCreate .shp vector grid in QGISQGIS Creating 4km point grid within polygonsCreate a vector grid over a raster layerVector Grid Creates just one grid

            Why is this plane circling around the Lucknow airport every day?Why do aircraft on Flight Radar 24 jump around randomly sometimes?What airport has this walkway over a taxiway?How does Chicago O'Hare's tower sequence aircraft at peak capacity?Which airport is featured in this Delta commercial?After a crash, for how long is the airport closed?Can a passenger plane stand still in the air, or hover at a fixed location above a ground?What are those trucks towing around, and why?What is this airport outside of Cairo, Egypt?Which US airport has the lowest circling MDH?What is this airport video?

            Nikolai Prilezhaev Bibliography References External links Navigation menuEarly Russian Organic Chemists and Their Legacy092774english translationRussian Biography