Overview
Brought to you by YData
Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 1000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 847.2 KiB |
| Average record size in memory | 867.6 B |
Variable types
| Numeric | 2 |
|---|---|
| Text | 5 |
| Categorical | 5 |
Alerts
expediente_invima is highly overall correlated with unidad_base and 1 other fields | High correlation |
factoresprecio is highly overall correlated with numerofactor | High correlation |
numerofactor is highly overall correlated with factoresprecio | High correlation |
unidad_base is highly overall correlated with expediente_invima and 1 other fields | High correlation |
unidad_de_dispensacion is highly overall correlated with expediente_invima and 1 other fields | High correlation |
Reproduction
| Analysis started | 2024-11-12 06:00:58.377129 |
|---|---|
| Analysis finished | 2024-11-12 06:01:24.419454 |
| Duration | 26.04 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
expediente_invima
Real number (ℝ)
High correlation
| Distinct | 553 |
|---|---|
| Distinct (%) | 55.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16989992 |
| Minimum | 10815 |
|---|---|
| Maximum | 19932353 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 10815 |
|---|---|
| 5-th percentile | 19512 |
| Q1 | 19903368 |
| median | 19912760 |
| Q3 | 19927074 |
| 95-th percentile | 19931121 |
| Maximum | 19932353 |
| Range | 19921538 |
| Interquartile range (IQR) | 23706 |
Descriptive statistics
| Standard deviation | 6822522.8 |
|---|---|
| Coefficient of variation (CV) | 0.40156126 |
| Kurtosis | 1.6836519 |
| Mean | 16989992 |
| Median Absolute Deviation (MAD) | 11746.5 |
| Skewness | -1.9120285 |
| Sum | 1.6989992 × 1010 |
| Variance | 4.6546817 × 1013 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 19907869 | 8 | 0.8% |
| 19906457 | 5 | 0.5% |
| 19929516 | 4 | 0.4% |
| 19930887 | 4 | 0.4% |
| 19930439 | 4 | 0.4% |
| 19906441 | 4 | 0.4% |
| 19918906 | 4 | 0.4% |
| 19901079 | 4 | 0.4% |
| 19908091 | 4 | 0.4% |
| 19929593 | 4 | 0.4% |
| Other values (543) | 955 |
| Value | Count | Frequency (%) |
| 10815 | 1 | |
| 11415 | 1 | |
| 11416 | 2 | |
| 11697 | 2 | |
| 11699 | 2 | |
| 11700 | 2 | |
| 11701 | 2 | |
| 11849 | 1 | |
| 11878 | 2 | |
| 11879 | 2 |
| Value | Count | Frequency (%) |
| 19932353 | 2 | |
| 19932247 | 2 | |
| 19932174 | 2 | |
| 19932170 | 1 | 0.1% |
| 19932152 | 2 | |
| 19932135 | 4 | |
| 19932108 | 1 | 0.1% |
| 19932060 | 2 | |
| 19932059 | 1 | 0.1% |
| 19931883 | 1 | 0.1% |
principio_activo
Text
| Distinct | 326 |
|---|---|
| Distinct (%) | 32.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.7 KiB |
Length
| Max length | 80 |
|---|---|
| Median length | 51 |
| Mean length | 15.312 |
| Min length | 7 |
Unique
| Unique | 62 ? |
|---|---|
| Unique (%) | 6.2% |
Sample
| 1st row | Midazolam |
|---|---|
| 2nd row | Acido Valproico |
| 3rd row | Acido Valproico |
| 4th row | Fluoxetina |
| 5th row | Proximetacaina |
| Value | Count | Frequency (%) |
| y | 93 | 5.6% |
| 84 | 5.0% | |
| de | 60 | 3.6% |
| acetaminofen | 36 | 2.2% |
| acido | 25 | 1.5% |
| hidroclorotiazida | 21 | 1.3% |
| clotrimazol | 20 | 1.2% |
| ibuprofeno | 17 | 1.0% |
| levotiroxina | 16 | 1.0% |
| sodica | 16 | 1.0% |
| Other values (370) | 1279 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1727 | 11.3% |
| i | 1715 | 11.2% |
| o | 1650 | 10.8% |
| n | 1122 | 7.3% |
| e | 954 | 6.2% |
| l | 866 | 5.7% |
| r | 859 | 5.6% |
| t | 805 | 5.3% |
| 667 | 4.4% | |
| c | 530 | 3.5% |
| Other values (46) | 4417 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15312 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1727 | 11.3% |
| i | 1715 | 11.2% |
| o | 1650 | 10.8% |
| n | 1122 | 7.3% |
| e | 954 | 6.2% |
| l | 866 | 5.7% |
| r | 859 | 5.6% |
| t | 805 | 5.3% |
| 667 | 4.4% | |
| c | 530 | 3.5% |
| Other values (46) | 4417 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15312 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1727 | 11.3% |
| i | 1715 | 11.2% |
| o | 1650 | 10.8% |
| n | 1122 | 7.3% |
| e | 954 | 6.2% |
| l | 866 | 5.7% |
| r | 859 | 5.6% |
| t | 805 | 5.3% |
| 667 | 4.4% | |
| c | 530 | 3.5% |
| Other values (46) | 4417 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15312 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1727 | 11.3% |
| i | 1715 | 11.2% |
| o | 1650 | 10.8% |
| n | 1122 | 7.3% |
| e | 954 | 6.2% |
| l | 866 | 5.7% |
| r | 859 | 5.6% |
| t | 805 | 5.3% |
| 667 | 4.4% | |
| c | 530 | 3.5% |
| Other values (46) | 4417 |
concentracion
Text
| Distinct | 466 |
|---|---|
| Distinct (%) | 46.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.3 KiB |
Length
| Max length | 89 |
|---|---|
| Median length | 72 |
| Mean length | 23.658 |
| Min length | 11 |
Unique
| Unique | 119 ? |
|---|---|
| Unique (%) | 11.9% |
Sample
| 1st row | Midazolam 15 mg |
|---|---|
| 2nd row | Divalproato Sodico 500 mg |
| 3rd row | Divalproato Sodico 500 mg |
| 4th row | Fluoxetina 20 mg |
| 5th row | Proximetacaina 5 mg |
| Value | Count | Frequency (%) |
| mg | 830 | 20.0% |
| g | 226 | 5.4% |
| 205 | 4.9% | |
| 500 | 83 | 2.0% |
| 1 | 83 | 2.0% |
| de | 80 | 1.9% |
| 10 | 75 | 1.8% |
| mcg | 75 | 1.8% |
| 50 | 73 | 1.8% |
| 100 | 66 | 1.6% |
| Other values (490) | 2353 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3151 | 13.3% | |
| a | 1809 | 7.6% |
| i | 1797 | 7.6% |
| o | 1754 | 7.4% |
| m | 1380 | 5.8% |
| g | 1176 | 5.0% |
| n | 1163 | 4.9% |
| 0 | 1113 | 4.7% |
| e | 1029 | 4.3% |
| l | 907 | 3.8% |
| Other values (57) | 8379 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 23658 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3151 | 13.3% | |
| a | 1809 | 7.6% |
| i | 1797 | 7.6% |
| o | 1754 | 7.4% |
| m | 1380 | 5.8% |
| g | 1176 | 5.0% |
| n | 1163 | 4.9% |
| 0 | 1113 | 4.7% |
| e | 1029 | 4.3% |
| l | 907 | 3.8% |
| Other values (57) | 8379 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 23658 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3151 | 13.3% | |
| a | 1809 | 7.6% |
| i | 1797 | 7.6% |
| o | 1754 | 7.4% |
| m | 1380 | 5.8% |
| g | 1176 | 5.0% |
| n | 1163 | 4.9% |
| 0 | 1113 | 4.7% |
| e | 1029 | 4.3% |
| l | 907 | 3.8% |
| Other values (57) | 8379 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 23658 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3151 | 13.3% | |
| a | 1809 | 7.6% |
| i | 1797 | 7.6% |
| o | 1754 | 7.4% |
| m | 1380 | 5.8% |
| g | 1176 | 5.0% |
| n | 1163 | 4.9% |
| 0 | 1113 | 4.7% |
| e | 1029 | 4.3% |
| l | 907 | 3.8% |
| Other values (57) | 8379 |
unidad_base
Categorical
High correlation
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.7 KiB |
| mg | |
|---|---|
| ml | |
| g | |
| mcg | 23 |
| dosis | 20 |
| Other values (2) | 16 |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 1.966 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | ml |
|---|---|
| 2nd row | mg |
| 3rd row | mg |
| 4th row | mg |
| 5th row | ml |
Common Values
| Value | Count | Frequency (%) |
| mg | 530 | |
| ml | 293 | |
| g | 118 | 11.8% |
| mcg | 23 | 2.3% |
| dosis | 20 | 2.0% |
| IU | 15 | 1.5% |
| MIU | 1 | 0.1% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| mg | 530 | |
| ml | 293 | |
| g | 118 | 11.8% |
| mcg | 23 | 2.3% |
| dosis | 20 | 2.0% |
| iu | 15 | 1.5% |
| miu | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| m | 846 | |
| g | 671 | |
| l | 293 | 14.9% |
| s | 40 | 2.0% |
| c | 23 | 1.2% |
| d | 20 | 1.0% |
| o | 20 | 1.0% |
| i | 20 | 1.0% |
| I | 16 | 0.8% |
| U | 16 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1966 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| m | 846 | |
| g | 671 | |
| l | 293 | 14.9% |
| s | 40 | 2.0% |
| c | 23 | 1.2% |
| d | 20 | 1.0% |
| o | 20 | 1.0% |
| i | 20 | 1.0% |
| I | 16 | 0.8% |
| U | 16 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1966 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| m | 846 | |
| g | 671 | |
| l | 293 | 14.9% |
| s | 40 | 2.0% |
| c | 23 | 1.2% |
| d | 20 | 1.0% |
| o | 20 | 1.0% |
| i | 20 | 1.0% |
| I | 16 | 0.8% |
| U | 16 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1966 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| m | 846 | |
| g | 671 | |
| l | 293 | 14.9% |
| s | 40 | 2.0% |
| c | 23 | 1.2% |
| d | 20 | 1.0% |
| o | 20 | 1.0% |
| i | 20 | 1.0% |
| I | 16 | 0.8% |
| U | 16 | 0.8% |
unidad_de_dispensacion
Categorical
High correlation
| Distinct | 18 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 62.2 KiB |
| Tableta | |
|---|---|
| Frasco | |
| Capsula | |
| Ampolla | |
| Vial | |
| Other values (13) |
Length
| Max length | 21 |
|---|---|
| Median length | 7 |
| Mean length | 6.613 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Ampolla |
|---|---|
| 2nd row | Tableta |
| 3rd row | Tableta |
| 4th row | Capsula |
| 5th row | Frasco |
Common Values
| Value | Count | Frequency (%) |
| Tableta | 406 | |
| Frasco | 162 | 16.2% |
| Capsula | 109 | 10.9% |
| Ampolla | 82 | 8.2% |
| Vial | 75 | 7.5% |
| Tubo | 69 | 6.9% |
| Inhalador | 24 | 2.4% |
| Bolsa | 17 | 1.7% |
| Jeringa Prellenada | 14 | 1.4% |
| Sobre | 9 | 0.9% |
| Other values (8) | 33 | 3.3% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| tableta | 411 | |
| frasco | 162 | 15.9% |
| capsula | 109 | 10.7% |
| ampolla | 82 | 8.0% |
| vial | 75 | 7.3% |
| tubo | 69 | 6.8% |
| inhalador | 24 | 2.4% |
| bolsa | 17 | 1.7% |
| jeringa | 14 | 1.4% |
| prellenada | 14 | 1.4% |
| Other values (10) | 44 | 4.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1494 | |
| l | 865 | |
| b | 494 | 7.5% |
| e | 484 | 7.3% |
| T | 480 | 7.3% |
| t | 430 | 6.5% |
| o | 389 | 5.9% |
| s | 297 | 4.5% |
| r | 236 | 3.6% |
| p | 198 | 3.0% |
| Other values (22) | 1246 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6613 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1494 | |
| l | 865 | |
| b | 494 | 7.5% |
| e | 484 | 7.3% |
| T | 480 | 7.3% |
| t | 430 | 6.5% |
| o | 389 | 5.9% |
| s | 297 | 4.5% |
| r | 236 | 3.6% |
| p | 198 | 3.0% |
| Other values (22) | 1246 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6613 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1494 | |
| l | 865 | |
| b | 494 | 7.5% |
| e | 484 | 7.3% |
| T | 480 | 7.3% |
| t | 430 | 6.5% |
| o | 389 | 5.9% |
| s | 297 | 4.5% |
| r | 236 | 3.6% |
| p | 198 | 3.0% |
| Other values (22) | 1246 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6613 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1494 | |
| l | 865 | |
| b | 494 | 7.5% |
| e | 484 | 7.3% |
| T | 480 | 7.3% |
| t | 430 | 6.5% |
| o | 389 | 5.9% |
| s | 297 | 4.5% |
| r | 236 | 3.6% |
| p | 198 | 3.0% |
| Other values (22) | 1246 |
nombre_comercial
Text
| Distinct | 427 |
|---|---|
| Distinct (%) | 42.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.4 KiB |
Length
| Max length | 40 |
|---|---|
| Median length | 29 |
| Mean length | 10.473 |
| Min length | 5 |
Unique
| Unique | 107 ? |
|---|---|
| Unique (%) | 10.7% |
Sample
| 1st row | Dormicum |
|---|---|
| 2nd row | Valcote |
| 3rd row | Valcote |
| 4th row | Fluoxetina |
| 5th row | Alcaine |
| Value | Count | Frequency (%) |
| de | 35 | 2.6% |
| synthroid | 16 | 1.2% |
| atorvastatina | 12 | 0.9% |
| 2 | 12 | 0.9% |
| cloruro | 10 | 0.7% |
| seretide | 10 | 0.7% |
| 10 | 0.7% | |
| crema | 10 | 0.7% |
| plus | 10 | 0.7% |
| clotrimazol | 10 | 0.7% |
| Other values (481) | 1206 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1066 | 10.2% |
| o | 967 | 9.2% |
| i | 955 | 9.1% |
| e | 752 | 7.2% |
| r | 661 | 6.3% |
| n | 651 | 6.2% |
| l | 626 | 6.0% |
| t | 548 | 5.2% |
| 341 | 3.3% | |
| c | 317 | 3.0% |
| Other values (55) | 3589 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10473 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1066 | 10.2% |
| o | 967 | 9.2% |
| i | 955 | 9.1% |
| e | 752 | 7.2% |
| r | 661 | 6.3% |
| n | 651 | 6.2% |
| l | 626 | 6.0% |
| t | 548 | 5.2% |
| 341 | 3.3% | |
| c | 317 | 3.0% |
| Other values (55) | 3589 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10473 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1066 | 10.2% |
| o | 967 | 9.2% |
| i | 955 | 9.1% |
| e | 752 | 7.2% |
| r | 661 | 6.3% |
| n | 651 | 6.2% |
| l | 626 | 6.0% |
| t | 548 | 5.2% |
| 341 | 3.3% | |
| c | 317 | 3.0% |
| Other values (55) | 3589 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10473 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1066 | 10.2% |
| o | 967 | 9.2% |
| i | 955 | 9.1% |
| e | 752 | 7.2% |
| r | 661 | 6.3% |
| n | 651 | 6.2% |
| l | 626 | 6.0% |
| t | 548 | 5.2% |
| 341 | 3.3% | |
| c | 317 | 3.0% |
| Other values (55) | 3589 |
fabricante
Text
| Distinct | 124 |
|---|---|
| Distinct (%) | 12.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.2 KiB |
Length
| Max length | 33 |
|---|---|
| Median length | 22 |
| Mean length | 9.01 |
| Min length | 2 |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | Siegfried |
|---|---|
| 2nd row | Lafrancol |
| 3rd row | Lafrancol |
| 4th row | Genfar |
| 5th row | Alcon |
| Value | Count | Frequency (%) |
| genfar | 68 | 5.6% |
| tecnoquimicas | 54 | 4.4% |
| pfizer | 42 | 3.4% |
| lafrancol | 42 | 3.4% |
| sanofi | 33 | 2.7% |
| aventis | 33 | 2.7% |
| procaps | 33 | 2.7% |
| glaxosmithkline | 31 | 2.5% |
| merck | 29 | 2.4% |
| la | 24 | 2.0% |
| Other values (145) | 829 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 979 | 10.9% |
| e | 901 | 10.0% |
| r | 707 | 7.8% |
| i | 660 | 7.3% |
| n | 659 | 7.3% |
| o | 495 | 5.5% |
| s | 465 | 5.2% |
| c | 407 | 4.5% |
| f | 319 | 3.5% |
| l | 266 | 3.0% |
| Other values (43) | 3152 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9010 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 979 | 10.9% |
| e | 901 | 10.0% |
| r | 707 | 7.8% |
| i | 660 | 7.3% |
| n | 659 | 7.3% |
| o | 495 | 5.5% |
| s | 465 | 5.2% |
| c | 407 | 4.5% |
| f | 319 | 3.5% |
| l | 266 | 3.0% |
| Other values (43) | 3152 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9010 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 979 | 10.9% |
| e | 901 | 10.0% |
| r | 707 | 7.8% |
| i | 660 | 7.3% |
| n | 659 | 7.3% |
| o | 495 | 5.5% |
| s | 465 | 5.2% |
| c | 407 | 4.5% |
| f | 319 | 3.5% |
| l | 266 | 3.0% |
| Other values (43) | 3152 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9010 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 979 | 10.9% |
| e | 901 | 10.0% |
| r | 707 | 7.8% |
| i | 660 | 7.3% |
| n | 659 | 7.3% |
| o | 495 | 5.5% |
| s | 465 | 5.2% |
| c | 407 | 4.5% |
| f | 319 | 3.5% |
| l | 266 | 3.0% |
| Other values (43) | 3152 |
medicamento
Text
| Distinct | 607 |
|---|---|
| Distinct (%) | 60.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 135.1 KiB |
Length
| Max length | 160 |
|---|---|
| Median length | 119 |
| Mean length | 77.425 |
| Min length | 53 |
Unique
| Unique | 214 ? |
|---|---|
| Unique (%) | 21.4% |
Sample
| 1st row | Dormicum (Siegfried) - Ampolla 3 ml - Cada 3 ml contiene: Midazolam 15 mg |
|---|---|
| 2nd row | Valcote (Lafrancol) - Cada Tableta contiene: Divalproato Sodico 500 mg |
| 3rd row | Valcote (Lafrancol) - Cada Tableta contiene: Divalproato Sodico 500 mg |
| 4th row | Fluoxetina (Genfar) - Cada Capsula contiene: Fluoxetina 20 mg |
| 5th row | Alcaine (Alcon) - Frasco 15 ml - Cada 1 ml contiene: Proximetacaina 5 mg |
| Value | Count | Frequency (%) |
| 1626 | 12.7% | |
| cada | 1000 | 7.8% |
| contiene | 1000 | 7.8% |
| mg | 842 | 6.6% |
| ml | 599 | 4.7% |
| tableta | 411 | 3.2% |
| g | 396 | 3.1% |
| 1 | 268 | 2.1% |
| 100 | 243 | 1.9% |
| frasco | 162 | 1.3% |
| Other values (1029) | 6222 |
Most occurring characters
| Value | Count | Frequency (%) |
| 11771 | ||
| a | 7348 | 9.5% |
| e | 5166 | 6.7% |
| o | 4645 | 6.0% |
| i | 4548 | 5.9% |
| n | 4531 | 5.9% |
| l | 3256 | 4.2% |
| t | 3120 | 4.0% |
| m | 2639 | 3.4% |
| c | 2518 | 3.3% |
| Other values (70) | 27883 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 77425 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 11771 | ||
| a | 7348 | 9.5% |
| e | 5166 | 6.7% |
| o | 4645 | 6.0% |
| i | 4548 | 5.9% |
| n | 4531 | 5.9% |
| l | 3256 | 4.2% |
| t | 3120 | 4.0% |
| m | 2639 | 3.4% |
| c | 2518 | 3.3% |
| Other values (70) | 27883 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 77425 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 11771 | ||
| a | 7348 | 9.5% |
| e | 5166 | 6.7% |
| o | 4645 | 6.0% |
| i | 4548 | 5.9% |
| n | 4531 | 5.9% |
| l | 3256 | 4.2% |
| t | 3120 | 4.0% |
| m | 2639 | 3.4% |
| c | 2518 | 3.3% |
| Other values (70) | 27883 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 77425 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 11771 | ||
| a | 7348 | 9.5% |
| e | 5166 | 6.7% |
| o | 4645 | 6.0% |
| i | 4548 | 5.9% |
| n | 4531 | 5.9% |
| l | 3256 | 4.2% |
| t | 3120 | 4.0% |
| m | 2639 | 3.4% |
| c | 2518 | 3.3% |
| Other values (70) | 27883 |
canal
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.4 KiB |
| Comercial | |
|---|---|
| Institucional |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 10.888 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Institucional |
|---|---|
| 2nd row | Comercial |
| 3rd row | Institucional |
| 4th row | Comercial |
| 5th row | Comercial |
Common Values
| Value | Count | Frequency (%) |
| Comercial | 528 | |
| Institucional | 472 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| comercial | 528 | |
| institucional | 472 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1472 | |
| o | 1000 | |
| c | 1000 | |
| a | 1000 | |
| l | 1000 | |
| n | 944 | |
| t | 944 | |
| C | 528 | 4.8% |
| m | 528 | 4.8% |
| e | 528 | 4.8% |
| Other values (4) | 1944 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10888 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 1472 | |
| o | 1000 | |
| c | 1000 | |
| a | 1000 | |
| l | 1000 | |
| n | 944 | |
| t | 944 | |
| C | 528 | 4.8% |
| m | 528 | 4.8% |
| e | 528 | 4.8% |
| Other values (4) | 1944 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10888 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 1472 | |
| o | 1000 | |
| c | 1000 | |
| a | 1000 | |
| l | 1000 | |
| n | 944 | |
| t | 944 | |
| C | 528 | 4.8% |
| m | 528 | 4.8% |
| e | 528 | 4.8% |
| Other values (4) | 1944 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10888 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 1472 | |
| o | 1000 | |
| c | 1000 | |
| a | 1000 | |
| l | 1000 | |
| n | 944 | |
| t | 944 | |
| C | 528 | 4.8% |
| m | 528 | 4.8% |
| e | 528 | 4.8% |
| Other values (4) | 1944 |
precio_por_tableta
Real number (ℝ)
| Distinct | 979 |
|---|---|
| Distinct (%) | 97.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42502.925 |
| Minimum | 0.41557789 |
|---|---|
| Maximum | 8096666.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0.41557789 |
|---|---|
| 5-th percentile | 93.195876 |
| Q1 | 732.4054 |
| median | 2810.0284 |
| Q3 | 13487.242 |
| 95-th percentile | 99797.148 |
| Maximum | 8096666.7 |
| Range | 8096666.3 |
| Interquartile range (IQR) | 12754.837 |
Descriptive statistics
| Standard deviation | 312805.82 |
|---|---|
| Coefficient of variation (CV) | 7.3596304 |
| Kurtosis | 464.06461 |
| Mean | 42502.925 |
| Median Absolute Deviation (MAD) | 2569.9023 |
| Skewness | 19.693376 |
| Sum | 42502925 |
| Variance | 9.7847482 × 1010 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1750 | 3 | 0.3% |
| 1475227 | 2 | 0.2% |
| 4648.666667 | 2 | 0.2% |
| 397348 | 2 | 0.2% |
| 5656 | 2 | 0.2% |
| 26504 | 2 | 0.2% |
| 250000 | 2 | 0.2% |
| 6220 | 2 | 0.2% |
| 306.4545455 | 2 | 0.2% |
| 6380 | 2 | 0.2% |
| Other values (969) | 979 |
| Value | Count | Frequency (%) |
| 0.4155778894 | 1 | |
| 0.6419291045 | 1 | |
| 2.8 | 1 | |
| 2.864129176 | 1 | |
| 2.933333333 | 1 | |
| 4.035476839 | 1 | |
| 4.236489726 | 1 | |
| 4.579038462 | 1 | |
| 6.343015873 | 1 | |
| 6.973516949 | 1 |
| Value | Count | Frequency (%) |
| 8096666.667 | 1 | |
| 3560621 | 1 | |
| 2856140.561 | 1 | |
| 1614836.99 | 1 | |
| 1475227 | 2 | |
| 1014406.686 | 1 | |
| 820899.284 | 1 | |
| 763672.5779 | 1 | |
| 703654 | 1 | |
| 626717.0858 | 1 |
factoresprecio
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 60.2 KiB |
| Medio | |
|---|---|
| Alto | |
| Bajo |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.55 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Alto |
|---|---|
| 2nd row | Medio |
| 3rd row | Medio |
| 4th row | Medio |
| 5th row | Medio |
Common Values
| Value | Count | Frequency (%) |
| Medio | 550 | |
| Alto | 265 | |
| Bajo | 185 | 18.5% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| medio | 550 | |
| alto | 265 | |
| bajo | 185 | 18.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1000 | |
| M | 550 | |
| e | 550 | |
| d | 550 | |
| i | 550 | |
| A | 265 | 5.8% |
| l | 265 | 5.8% |
| t | 265 | 5.8% |
| B | 185 | 4.1% |
| a | 185 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4550 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1000 | |
| M | 550 | |
| e | 550 | |
| d | 550 | |
| i | 550 | |
| A | 265 | 5.8% |
| l | 265 | 5.8% |
| t | 265 | 5.8% |
| B | 185 | 4.1% |
| a | 185 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4550 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1000 | |
| M | 550 | |
| e | 550 | |
| d | 550 | |
| i | 550 | |
| A | 265 | 5.8% |
| l | 265 | 5.8% |
| t | 265 | 5.8% |
| B | 185 | 4.1% |
| a | 185 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4550 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1000 | |
| M | 550 | |
| e | 550 | |
| d | 550 | |
| i | 550 | |
| A | 265 | 5.8% |
| l | 265 | 5.8% |
| t | 265 | 5.8% |
| B | 185 | 4.1% |
| a | 185 | 4.1% |
numerofactor
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 56.8 KiB |
| 2 | |
|---|---|
| 3 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 550 | |
| 3 | 265 | |
| 1 | 185 | 18.5% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 550 | |
| 3 | 265 | |
| 1 | 185 | 18.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 550 | |
| 3 | 265 | |
| 1 | 185 | 18.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 550 | |
| 3 | 265 | |
| 1 | 185 | 18.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 550 | |
| 3 | 265 | |
| 1 | 185 | 18.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 550 | |
| 3 | 265 | |
| 1 | 185 | 18.5% |
Interactions
Correlations
| canal | expediente_invima | factoresprecio | numerofactor | precio_por_tableta | unidad_base | unidad_de_dispensacion | |
|---|---|---|---|---|---|---|---|
| canal | 1.000 | 0.000 | 0.011 | 0.011 | 0.000 | 0.000 | 0.070 |
| expediente_invima | 0.000 | 1.000 | 0.376 | 0.376 | -0.001 | 0.671 | 0.638 |
| factoresprecio | 0.011 | 0.376 | 1.000 | 1.000 | 0.064 | 0.069 | 0.114 |
| numerofactor | 0.011 | 0.376 | 1.000 | 1.000 | 0.064 | 0.069 | 0.114 |
| precio_por_tableta | 0.000 | -0.001 | 0.064 | 0.064 | 1.000 | 0.143 | 0.144 |
| unidad_base | 0.000 | 0.671 | 0.069 | 0.069 | 0.143 | 1.000 | 0.626 |
| unidad_de_dispensacion | 0.070 | 0.638 | 0.114 | 0.114 | 0.144 | 0.626 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
| expediente_invima | principio_activo | concentracion | unidad_base | unidad_de_dispensacion | nombre_comercial | fabricante | medicamento | canal | precio_por_tableta | factoresprecio | numerofactor | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 103795 | Midazolam | Midazolam 15 mg | ml | Ampolla | Dormicum | Siegfried | Dormicum (Siegfried) - Ampolla 3 ml - Cada 3 ml contiene: Midazolam 15 mg | Institucional | 11199.8 | Alto | 3 |
| 1 | 104739 | Acido Valproico | Divalproato Sodico 500 mg | mg | Tableta | Valcote | Lafrancol | Valcote (Lafrancol) - Cada Tableta contiene: Divalproato Sodico 500 mg | Comercial | 3752.866667 | Medio | 2 |
| 2 | 104739 | Acido Valproico | Divalproato Sodico 500 mg | mg | Tableta | Valcote | Lafrancol | Valcote (Lafrancol) - Cada Tableta contiene: Divalproato Sodico 500 mg | Institucional | 1777.266522 | Medio | 2 |
| 3 | 10815 | Fluoxetina | Fluoxetina 20 mg | mg | Capsula | Fluoxetina | Genfar | Fluoxetina (Genfar) - Cada Capsula contiene: Fluoxetina 20 mg | Comercial | 329.295281 | Medio | 2 |
| 4 | 111057 | Proximetacaina | Proximetacaina 5 mg | ml | Frasco | Alcaine | Alcon | Alcaine (Alcon) - Frasco 15 ml - Cada 1 ml contiene: Proximetacaina 5 mg | Comercial | 64184.74576 | Medio | 2 |
| 5 | 111057 | Proximetacaina | Proximetacaina 5 mg | ml | Frasco | Alcaine | Alcon | Alcaine (Alcon) - Frasco 15 ml - Cada 1 ml contiene: Proximetacaina 5 mg | Institucional | 45600 | Medio | 2 |
| 6 | 113757 | Immunoglobulina Antitimocito | Immunoglobulina Antitimocito 25 mg | mg | Vial | Timoglobulina | Genzyme | Timoglobulina (Genzyme) - Cada Vial contiene: Immunoglobulina Antitimocito 25 mg | Institucional | 626717.0858 | Medio | 2 |
| 7 | 11415 | Alopurinol | Alopurinol 300 mg | mg | Tableta | Alopurinol | Memphis | Alopurinol (Memphis) - Cada Tableta contiene: Alopurinol 300 mg | Comercial | 365.3996782 | Bajo | 1 |
| 8 | 11416 | Haloperidol | Haloperidol 10 mg | mg | Tableta | Haloperidol | Memphis | Haloperidol (Memphis) - Cada Tableta contiene: Haloperidol 10 mg | Comercial | 544.4616667 | Alto | 3 |
| 9 | 11416 | Haloperidol | Haloperidol 10 mg | mg | Tableta | Haloperidol | Memphis | Haloperidol (Memphis) - Cada Tableta contiene: Haloperidol 10 mg | Institucional | 2.933333333 | Bajo | 1 |
| expediente_invima | principio_activo | concentracion | unidad_base | unidad_de_dispensacion | nombre_comercial | fabricante | medicamento | canal | precio_por_tableta | factoresprecio | numerofactor | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 990 | 19932135 | Betametasona + Neomicina + Clotrimazol | Neomicina 0,5 g + Betametasona 0,04 mg + Clotrimazol 1 g | g | Tubo | Betametasona | Genfar | Betametasona (Genfar) - Tubo 40 g - Cada 100 g contiene: Neomicina 0,5 g + Betametasona 0,04 mg + Clotrimazol 1 g | Institucional | 2935.287613 | Medio | 2 |
| 991 | 19932152 | Ciclosporina | Ciclosporina 1 mg | ml | Frasco | Modusik | Sophia | Modusik (Sophia) - Frasco 5 ml - Cada 1 ml contiene: Ciclosporina 1 mg | Comercial | 166678.2419 | Medio | 2 |
| 992 | 19932152 | Ciclosporina | Ciclosporina 1 mg | ml | Frasco | Modusik | Sophia | Modusik (Sophia) - Frasco 5 ml - Cada 1 ml contiene: Ciclosporina 1 mg | Institucional | 128830.965 | Medio | 2 |
| 993 | 19932170 | Tenecteplasa | Tenecteplasa 50 mg | mg | Vial | Metalyse | Boehringer | Metalyse (Boehringer) - Cada Vial contiene: Tenecteplasa 50 mg | Institucional | 2856140.561 | Medio | 2 |
| 994 | 19932174 | Metoclopramida | Metoclopramida 10 mg | mg | Tableta | Plasil | Bussié | Plasil (Bussié) - Cada Tableta contiene: Metoclopramida 10 mg | Comercial | 1729.481777 | Alto | 3 |
| 995 | 19932174 | Metoclopramida | Metoclopramida 10 mg | mg | Tableta | Plasil | Bussié | Plasil (Bussié) - Cada Tableta contiene: Metoclopramida 10 mg | Institucional | 251.8541667 | Alto | 3 |
| 996 | 19932247 | Galantamina | Galantamina 400 mg | ml | Frasco | Reminyl | Janssen | Reminyl (Janssen) - Frasco 100 ml - Cada 100 ml contiene: Galantamina 400 mg | Comercial | 472437.0579 | Medio | 2 |
| 997 | 19932247 | Galantamina | Galantamina 400 mg | ml | Frasco | Reminyl | Janssen | Reminyl (Janssen) - Frasco 100 ml - Cada 100 ml contiene: Galantamina 400 mg | Institucional | 455593.287 | Medio | 2 |
| 998 | 19932353 | Cefuroxima | Cefuroxima 500 mg | mg | Tableta | Cefuroxima | Genfar | Cefuroxima (Genfar) - Cada Tableta contiene: Cefuroxima 500 mg | Comercial | 3663.509646 | Medio | 2 |
| 999 | 19932353 | Cefuroxima | Cefuroxima 500 mg | mg | Tableta | Cefuroxima | Genfar | Cefuroxima (Genfar) - Cada Tableta contiene: Cefuroxima 500 mg | Institucional | 2354 | Bajo | 1 |