scatterplot with correlation coefficient , p-value

October 5, 2022October 5, 2022 abgoswam visualization

Single

from scipy import stats

result_0928 = stats.pearsonr(df_0928["dt_1"], y=df_0928["prev_deltaMs"])
print("coef: {0}".format(result_0928.statistic))
print("p-value: {0}".format(result_0928.pvalue))

import matplotlib.pyplot as plt

plt.scatter(df_0928["dt_1"], 
            df_0928["prev_deltaMs"], 
            c ="blue",
            label="pearson coef:{0} p-value:{1}".format(round(result_0928.statistic, 3), round(result_0928.pvalue,8)))
            
plt.xlabel("dt_1")
plt.ylabel("prev_deltaMs")
plt.title("09/28.  pearson coef:{0} p-value:{1}".format(
    round(result_0928.statistic, 3), 
    round(result_0928.pvalue, 7)))

# plt.legend()
plt.show()

Subplots

fig, ax = plt.subplots(nrows=1, ncols=2, figsize=(12, 4))

ax[0].scatter(df_0928["dt_1"], df_0928["prev_deltaMs"])
ax[0].title.set_text("09/28.  pearson coef:{0} p-value:{1}".format(
    round(result_0928.statistic, 3), 
    round(result_0928.pvalue, 15)))
ax[0].set(xlabel='dt_1', ylabel='prev_deltaMs')
    

ax[1].scatter(df_0928["dt_1"], df_0928["prev_deltaMs"])
ax[1].title.set_text("09/28.  pearson coef:{0} p-value:{1}".format(
    round(result_0928.statistic, 3), 
    round(result_0928.pvalue,15)))
ax[1].set(xlabel='dt_1', ylabel='prev_deltaMs')
    
fig.subplots_adjust(wspace=.4)    
plt.show()

Histogram matplotlib

October 2, 2022October 2, 2022 abgoswam visualization

see the counts (accept_n) and edge boundaries (accept_bins) when plotting histograms in matplotlib

N=50
M=max(data_accept.dt_1)
print(M)
accept_n, accept_bins, _ = plt.hist(data_accept.dt_1, N, range=[0, N], label='accept') 
reject_n, reject_bins, _ = plt.hist(data_reject.dt_1, N, range=[0, N], label='reject')
print(accept_n)
print(accept_bins)
print(reject_n)
print(reject_bins)
plt.legend()
plt.show()

References

ASCII, Unicode, UTF-8 hell

December 9, 2020 abgoswam python

These two articles helped clear up a lot of my confusion:

Ubuntu Docker + python3

November 4, 2020February 27, 2021 abgoswam python

I recently had to do a quick test of using python with ubuntu. I decided to use docker.

steps:

sudo docker run -it ubuntu bash

apt-get update
apt-get install python3-pip

# python3 --version
Python 3.8.5

to load up other stuff

sudo docker run -it -v $HOME:/work pytorch/pytorch:1.6.0-cuda10.1-cudnn7-devel bash

sudo docker run -it --ipc=host --rm -v $HOME:/work --privileged pytorch/pytorch:1.6.0-cuda10.1-cudnn7-devel bash

sudo docker run -it -v $HOME:/work py37_pytorch16_dte bash

sudo docker run -it -v $HOME:/work py37_trch16_trfmr43 bash

Beyond Integer indexing

June 27, 2020 abgoswam numpy, python

Faced an interesting problem recently

a : (B, S, T)
b : (B, C)  where 0 <= x[i, j] < S

What I want is an array of shape (B, C, T)

a = np.array(
   ...:    [[[0,1,2,3], 
   ...:      [4,5,6,7],
   ...:      [8,9,10,11]],
   ...:     [[0,1,2,3],
   ...:      [4,5,6,7],
   ...:      [8,9,10,11]]])

b = np.array(
   ...:    [[0,2,2],
   ...:     [1,0, 2]])

a.shape
Out[79]: (2, 3, 4)

b.shape
Out[80]: (2, 3)

What I expect is this

array([[[ 0,  1,  2,  3],
        [ 8,  9, 10, 11],
        [ 8,  9, 10, 11]],
       [[ 4,  5,  6,  7],
        [ 0,  1,  2,  3],
        [ 8,  9, 10, 11]]])

Note this is different from the typical scenario

Initially I hit some issues with integer index broadcasting. It seems it is possible to do it.

a[np.array([np.arange(2)]).T, b]

References:

PyTest live logging in PyCharm

June 5, 2020 abgoswam Uncategorized

PyTest does allow output to be ‘live printed’

Also, it possible to see logging output in PyTest

Checkout these two links:

Updating the transformers package

June 2, 2020 abgoswam Uncategorized

Few steps I follow each time i update the transformers package

git pull
pip install --upgrade .
pip install -r ./examples/requirements.txt

that’s

Latex multirow and column

May 29, 2020 abgoswam devproductivity

Short and intuitive example

https://texblog.org/2012/12/21/multi-column-and-multi-row-cells-in-latex-tables/

Numpy RuntimeWarning

May 11, 2020 abgoswam python

Something I learns recently..

NumPy has its own internal warning architecture on top of Pythons, which can be specifically controlled
So, something Numpy will just produce a RuntimeWarning without actually throwing an exception

Consider this:

probs = np.array([0.0, 1.0])
np.prod(probs)**(-1/len(probs))

Numpy produces a RuntimeWarning, not an exception

References:

Gradient accumulation in PyTorch

May 10, 2020 abgoswam pytorch

Need to understand:

abgoswam's tech blog

Data Science, Machine Learning, CS Theory, Systems & Web