Python | Панды Series.describe ()
Серия Pandas - это одномерный массив ndarray с метками осей. Этикетки не обязательно должны быть уникальными, но должны быть хешируемого типа. Объект поддерживает индексирование как на основе целых чисел, так и на основе меток и предоставляет множество методов для выполнения операций, связанных с индексом.
Pandas Series.describe() function generate a descriptive statistics that summarize the central tendency, dispersion and shape of a dataset’s distribution for the given series object. All the calculations are performed by excluding NaN values.
Syntax: Series.describe(percentiles=None, include=None, exclude=None)
Parameter :
percentiles : The percentiles to include in the output.
include : A white list of data types to include in the result. Ignored for Series.
exclude : A black list of data types to omit from the result. Ignored for SeriesReturns : Summary statistics of the Series
Example #1: Use Series.describe() function to find the summary statistics of the given series object.
# importing pandas as pdimport pandas as pd # Creating the Seriessr = pd.Series([80, 25, 3, 25, 24, 6]) # Create the Indexindex_ = ["Coca Cola", "Sprite", "Coke", "Fanta", "Dew", "ThumbsUp"] # set the indexsr.index = index_ # Print the seriesprint(sr) |
Выход :
Now we will use Series.describe() function to find the summary statistics of the underlying data in the given series object.
# find summary statistics of the underlying # data in the given series object.result = sr.describe() # Print the resultprint(result) |
Output :
As we can see in the output, the Series.describe() function has successfully returned the summary statistics of the given series object.
Example #2 : Use Series.describe() function to find the summary statistics of the underlying data in the given series object. The given series object contains some missing values.
# importing pandas as pdimport pandas as pd # Creating the Seriessr = pd.Series([100, None, None, 18, 65, None, 32, 10, 5, 24, None]) # Create the Indexindex_ = pd.date_range("2010-10-09", periods = 11, freq ="M") # set the indexsr.index = index_ # Print the seriesprint(sr) |
Выход :

Now we will use Series.describe() function to find the summary statistics of the underlying data in the given series object.
# find summary statistics of the underlying # data in the given series object.result = sr.describe() # Print the resultprint(result) |
Выход :

As we can see in the output, the Series.describe() function has successfully returned the summary statistics of the given series object. NaN values has been ignored while calculating these statistical values.
Внимание компьютерщик! Укрепите свои основы с помощью базового курса программирования Python и изучите основы.
Для начала подготовьтесь к собеседованию. Расширьте свои концепции структур данных с помощью курса Python DS. А чтобы начать свое путешествие по машинному обучению, присоединяйтесь к курсу Машинное обучение - базовый уровень.