-
Notifications
You must be signed in to change notification settings - Fork 7.9k
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Data preprocessing #95
Comments
Ok, what's your objective ?
…On Thu, Mar 2, 2023, 15:58 amira-yahlali ***@***.***> wrote:
I'm trying to clean my data and do some preprocessing but i don't have
much understanding of the Columns if the zero in them are normal or missing
values i'm using the dataset cic-collection on kaggle if any expert would
help i'd be much thankful
—
Reply to this email directly, view it on GitHub
<#95>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AMQQRCQQ3UD2DHFC2J2VBX3W2BY6VANCNFSM6AAAAAAVNF5SK4>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
I just need an understanding of what the columns represent and if the null value in each columns is a normal value or is it a missing value i'm trying to preprocess my data and like minimize it |
columns represent and if the
null value in each columns is a normal value
or is it a missing value need to see your data to identify theses points ?
i'm trying to preprocess
On Thu, Mar 2, 2023 at 6:07 PM amira-yahlali ***@***.***>
wrote:
… Ok, what's your objective ?
On Thu, Mar 2, 2023, 15:58 amira-yahlali *@*.***> wrote:
I'm trying to clean my data and do some preprocessing but i don't have
much understanding of the Columns if the zero in them are normal or missing
values i'm using the dataset cic-collection on kaggle if any expert would
help i'd be much thankful
—
Reply to this email directly, view it on GitHub
#95
<#95>,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/AMQQRCQQ3UD2DHFC2J2VBX3W2BY6VANCNFSM6AAAAAAVNF5SK4
.
You are receiving this because you are subscribed to this thread.Message
ID: *@*.***>
I just need an understanding of what the columns represent and if the null
value in each columns is a normal value or is it a missing value i'm trying
to preprocess my data and like minimize it
—
Reply to this email directly, view it on GitHub
<#95 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AMQQRCWVTJXIJEV54HXN4TTW2CH6ZANCNFSM6AAAAAAVNF5SK4>
.
You are receiving this because you commented.Message ID:
***@***.***
com>
|
My data is the cic-ids-collection on kaggle using class label as target dropping label and the rest is features i'd love to send you my notebook directly to make it easier for you |
Hi, |
see brother, if u want to remove the columns having all the null values/missing values you can use : data.drop(colums=[' ',' ' ] , inplace=true) in order to remove those columns if u want to check the columns with number of non null values you can use data.info() to have precise understanding for the data . if order to check the outliers in the data you can use seaborn library and import pairplot fucntion i.e seaborn.pairplot in oder to have graph depicting the outliers . Regards |
Hello
Thanks for reaching out the problem has been fixed thank you for
consideration.
Best regards
…On Mon, Aug 12, 2024, 06:21 Heerak kashyap ***@***.***> wrote:
I'm trying to clean my data and do some preprocessing but i don't have
much understanding of the Columns if the zero in them are normal or missing
values i'm using the dataset cic-collection on kaggle if any expert would
help i'd be much thankful
see brother, if u want to remove the columns having all the null
values/missing values you can use : data.drop(colums=[' ',' ' ] ,
inplace=true) in order to remove those columns
if u want to check the columns with number of non null values you can use
data.info() to have precise understanding for the data .
if order to check the outliers in the data you can use seaborn library and
import pairplot fucntion i.e seaborn.pairplot in oder to have graph
depicting the outliers .
Regards
—
Reply to this email directly, view it on GitHub
<#95 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/A3DMJZS6NMVN2OGJMBENM2DZRBA5HAVCNFSM6AAAAAAVNF5SK6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEOBTGEZTCNZYGA>
.
You are receiving this because you authored the thread.Message ID:
***@***.***
com>
|
I'm trying to clean my data and do some preprocessing but i don't have much understanding of the Columns if the zero in them are normal or missing values i'm using the dataset cic-collection on kaggle if any expert would help i'd be much thankful
The text was updated successfully, but these errors were encountered: