Channel: Is there a way to make this Python kNN function more efficient? - Stack Overflow

↧

Is there a way to make this Python kNN function more efficient?

October 22, 2014, 3:36 am

≪ Previous: Answer by Ara for Is there a way to make this Python kNN function more efficient?

After having troubles with MATLAB I decided to try Python:

I wrote a function that calculates kNN when the samples are of my own class using my own distance function:

def closestK(sample, otherSamples, distFunc, k):"Returns the closest k samples to sample based on distFunc"    n = len(otherSamples)    d = [distFunc(sample, otherSamples[i]) for i in range(0,n)]    idx  = sorted(range(0,len(d)), key=lambda k: d[k])    return idx[1:(k+1)]def kNN(samples, distFunc, k):    return [[closestK(samples[i], samples, distFunc, k)] for i in range(len(samples))]

and this is the distance function:

@staticmethod    def distanceRepr(c1, c2):    r1 = c1.repr    r2 = c2.repr    # because cdist needs 2D array    if r1.ndim == 1:        r1 = np.vstack([r1,r1])    if r2.ndim == 1:        r2 = np.vstack([r2,r2])    return scipy.spatial.distance.cdist(r1, r2, 'euclidean').min()

But it still works amazingly slower compared to the "normal" kNN function, even when using "brute" algorithm. Am I doing something wrong?

UPDATE

I'm adding the constructor of the class. The attribute repr contains a set of vectors (from 1 to whatever) and the distance is calculated to be the minimal euclidean distance between the two sets of repr.

class myCluster:    def __init__(self, index = -1, P = np.array([])):        if index ==-1 :            self.repr = np.array([])            self.IDs = np.array([])            self.n = 0            self.center = np.array([])        else:            self.repr = np.array(P)            self.IDs = np.array(index)            self.n = 1            self.center = np.array(P)

and the rest of relevant code (X is a matrix whose rows are samples and columns are variables):

level = [myCluster(i, X[i,:]) for i in range(0,n)]kNN(level, myCluster.distanceRepr, 3)

UPDATE 2

I've made some measurements and the line that takes most of the time is

d = [distFunc(sample, otherSamples[i]) for i in range(0,n)]

So there is something with the distFunc. When I change it to return

np.linalg.norm(c1.repr-c2.repr)

i.e. "normal" vector calculation, with no sorting, the running time stays the same. So the problem lies in the calling of this function. Does it make sense that the use of classes changes the running time by a factor of 60?

↧

Latest Images

Scotland hit with 16 flood alerts as thunderstorms set to batter the country...

Scotland hit with 16 flood alerts as thunderstorms set to batter the country...

July 20, 2025, 6:42 am

7 clever tricks Primark does to keep you walking & buying more than you need...

7 clever tricks Primark does to keep you walking & buying more than you need...

July 20, 2025, 5:14 am

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

July 20, 2025, 5:06 am

Paintings of English Downs 2

Paintings of English Downs 2

July 20, 2025, 4:30 am

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

July 20, 2025, 1:14 am

Who is Kevin Lerena’s wife Geraldine?

Who is Kevin Lerena’s wife Geraldine?

July 20, 2025, 12:57 am

Man stabs woman, baby to death inside Queens home, police say

Man stabs woman, baby to death inside Queens home, police say

July 19, 2025, 11:00 pm

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

July 19, 2025, 9:45 pm

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

July 19, 2025, 7:29 pm

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

July 19, 2025, 2:11 pm

Trending Articles

Sarah Samis, Emil Bove III

November 17, 2012, 9:36 pm

Tyler, The Creator – CHROMAKOPIA [iTunes Plus M4A]

October 28, 2024, 4:56 am

Flux Full Pack 2.1 v3.5.16-R2R

May 6, 2016, 3:14 am

RAD Studio Community Edition XE 10.2.3 build 3231, some links to explanations...

July 19, 2018, 6:28 pm

#MungaTheThief : Man Who’s Son Committed Suicide At Black Diamond Has Looted...

January 22, 2017, 8:06 pm

Treecard Games (6 in 1) Keygen v1.7 By DeltaFoX

September 23, 2019, 9:37 pm

England Font 2020-2021

October 25, 2020, 9:02 pm

[RELEASE THREAD]--_A-Team_--Cricket_Dream_5G

September 25, 2022, 7:14 pm

236 kg banned scented tobacco worth Rs 1.26 lakh seized in Wadi

June 22, 2021, 5:54 am

Farrah Stone Johnson Pitcher Jon Lester’s wife

October 10, 2016, 9:56 am

Happy Birthday Wishes for Bhabhi in Hindi & English |हैप्पी बर्थडे भाभी

March 13, 2020, 3:01 am

Students hit streets to save Agriculture College land in city

October 13, 2018, 2:20 am

A/L Technology Stream – Subject combinations, Syllabuses and Teacher guides

December 17, 2013, 6:12 pm

Gang boss ordered hit on Tommy Crossan in row over money

April 22, 2014, 3:26 am

बिना कपड़े उतारे भी लें सकते हैं सेक्स का मज़ा, ट्राई करें ये नया तरीकाबिना...

August 3, 2019, 7:08 pm

James Martin Normandy tart on James Martin’s French Adventure

February 21, 2017, 7:26 am

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

Firefighters attend two-car crash in Manor Royal

March 17, 2015, 2:08 am

Waves Complete v2019.02.14 Incl Emulator-R2R

February 16, 2019, 7:50 am

Premier League 2023-2024 Font (OTF & Vector)

May 30, 2023, 10:38 pm

Latest Images

Scotland hit with 16 flood alerts as thunderstorms set to batter the country...

Scotland hit with 16 flood alerts as thunderstorms set to batter the country...

July 20, 2025, 6:42 am

7 clever tricks Primark does to keep you walking & buying more than you need...

7 clever tricks Primark does to keep you walking & buying more than you need...

July 20, 2025, 5:14 am

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

July 20, 2025, 5:06 am

Paintings of English Downs 2

Paintings of English Downs 2

July 20, 2025, 4:30 am

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

July 20, 2025, 1:14 am

Who is Kevin Lerena’s wife Geraldine?

Who is Kevin Lerena’s wife Geraldine?

July 20, 2025, 12:57 am

Man stabs woman, baby to death inside Queens home, police say

Man stabs woman, baby to death inside Queens home, police say

July 19, 2025, 11:00 pm

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

July 19, 2025, 9:45 pm

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

July 19, 2025, 7:29 pm

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

July 19, 2025, 2:11 pm

© 2025 //www.rssing.com