发帖

【优惠升级】华秋PCB首单最高立减100元，SMT免费贴片！！！

[资料]

第9章 BasicMathFunctions的使用（二）

2016-9-22 13:12:02 5475 源代码

0 转dsp系列教程本期教程主要讲基本函数中的相反数，偏移，位移，减法和比例因子。 9.1 相反数（Vector Negate） 9.2 求和（Vector Offset） 9.3 点乘（Vector Shift） 9.4 减法（Vector Sub） 9.5 比例因子（Vector Scale） 9.6 BasicMathFunctions的重要说明 9.7 总结 9.1 相反数（Vector Negate）这部分函数主要用于求相反数，公式描述如下： pDst[n] = -pSrc[n], 0 <= n < blockSize. 特别注意，这部分函数支持目标指针和源指针指向相同的缓冲区。 9.1.1 arm_negate_f32 这个函数用于求32位浮点数的相反数，源代码分析如下： [url=]复制代码[/url] /** * @brief Negates the elements of a floating-point vector. * @param[in] pSrc points to the input vector @param[out] pDst points to the output vector @param[in] blockSize number of samples in the vector * @Return none. / void arm_negate_f32( float32_t pSrc, float32_t * pDst, uint32_t blockSize) { uint32_t blkCnt; /* loop counter / #ifndef ARM_MATH_CM0_FAMILY / Run the below code for Cortex-M4 and Cortex-M3 / float32_t in1, in2, in3, in4; / temporary variables / /loop Unrolling / blkCnt = blockSize >> 2u; / First part of the processing with loop unrolling. Compute 4 outputs at a time. ** a second loop below computes the remaining 1 to 3 samples. / while(blkCnt > 0u) { / read inputs from source / in1 = pSrc; in2 = (pSrc + 1); in3 = (pSrc + 2); in4 = (pSrc + 3); / negate the input / (1) in1 = -in1; in2 = -in2; in3 = -in3; in4 = -in4; / store the result to destination / pDst = in1; (pDst + 1) = in2; (pDst + 2) = in3; (pDst + 3) = in4; / update pointers to process next samples / pSrc += 4u; pDst += 4u; / Decrement the loop counter / blkCnt--; } / If the blockSize is not a multiple of 4, compute any remaining output samples here. ** No loop unrolling is used. / blkCnt = blockSize % 0x4u; #else / Run the below code for Cortex-M0 / / Initialize blkCnt with number of samples / blkCnt = blockSize; #endif / #ifndef ARM_MATH_CM0_FAMILY / while(blkCnt > 0u) { / C = -A / / Negate and then store the results in the destination buffer. / pDst++ = -pSrc++; / Decrement the loop counter */ blkCnt--; } } 1. 浮点数的相反数求解比较简单，直接在相应的变量前加上负号即可。 0
举报淘帖0 只看该作者相关推荐 • 第9章硬盘和外围设备管理 1221 • 智能控制（[刘金琨编着]第1版）--第9章神经网络控制 2801 • 第9章机器人控制MATLAB 仿真程序 3714 • 智能控制--第9章神经网络控制 1969 • 第9章停车场管理系统 2327 • 机械设计基础答案(第五版)第9章 1139 • 《精通LabVIEW程序设计》一书的课件第9章 LabVIEW在模拟电子中的应用 5328 • 【安富莱——DSP教程】第16章 ControllerFunctions的使用（二） 9376 • 【安富莱DSP教程】第9章 BasicMathFunctions的使用（二） 3371 • 【安富莱——DSP教程】第8章 BasicMathFunctions的使用（一） 8280 33条评论发表评论只看该作者

lee_st · 2016-9-23 08:00:37 22^# 9.4.3 arm_sub_q15 这个函数用于求16位定点数的减法，源代码分析如下：复制代码 /** * @brief Q15 vector subtraction. * @param[in] pSrcA points to the first input vector @param[in] pSrcB points to the second input vector @param[out] pDst points to the output vector @param[in] blockSize number of samples in each vector * @return none. * * Scaling and Overflow Behavior: * par * The function uses saturating arithmetic. * Results outside of the allowable Q15 range [0x8000 0x7FFF] will be saturated. / void arm_sub_q15( q15_t pSrcA, q15_t * pSrcB, q15_t * pDst, uint32_t blockSize) { uint32_t blkCnt; /* loop counter / #ifndef ARM_MATH_CM0_FAMILY / Run the below code for Cortex-M4 and Cortex-M3 / q31_t inA1, inA2; q31_t inB1, inB2; /loop Unrolling / blkCnt = blockSize >> 2u; / First part of the processing with loop unrolling. Compute 4 outputs at a time. ** a second loop below computes the remaining 1 to 3 samples. / while(blkCnt > 0u) { / C = A - B / / Subtract and then store the results in the destination buffer two samples at a time. / inA1 = __SIMD32(pSrcA)++; (1) inA2 = __SIMD32(pSrcA)++; inB1 = __SIMD32(pSrcB)++; inB2 = __SIMD32(pSrcB)++; __SIMD32(pDst)++ = __QSUB16(inA1, inB1); (2) __SIMD32(pDst)++ = __QSUB16(inA2, inB2); / Decrement the loop counter / blkCnt--; } / If the blockSize is not a multiple of 4, compute any remaining output samples here. ** No loop unrolling is used. / blkCnt = blockSize % 0x4u; while(blkCnt > 0u) { / C = A - B / / Subtract and then store the result in the destination buffer. / pDst++ = (q15_t) __QSUB16(pSrcA++, pSrcB++); /* Decrement the loop counter / blkCnt--; } #else / Run the below code for Cortex-M0 / / Initialize blkCnt with number of samples / blkCnt = blockSize; while(blkCnt > 0u) { / C = A - B / / Subtract and then store the result in the destination buffer. / pDst++ = (q15_t) __SSAT(((q31_t) * pSrcA++ - pSrcB++), 16); / Decrement the loop counter / blkCnt--; } #endif / #ifndef ARM_MATH_CM0_FAMILY */ } 1. 这里一次读取两个Q15格式的数据。 2. 由于__QSUB16是SIMD指令，在这里调用一次__QSUB16可以实现两次减法运算。

赞回复举报

lee_st · 2016-9-23 08:00:49 23^# 9.4.4 arm_sub_q7 这个函数用于求8位定点数的减法，源代码分析如下：复制代码 /** * @brief Q7 vector subtraction. * @param[in] pSrcA points to the first input vector @param[in] pSrcB points to the second input vector @param[out] pDst points to the output vector @param[in] blockSize number of samples in each vector * @return none. * * Scaling and Overflow Behavior: * par * The function uses saturating arithmetic. * Results outside of the allowable Q7 range [0x80 0x7F] will be saturated. / void arm_sub_q7( q7_t pSrcA, q7_t * pSrcB, q7_t * pDst, uint32_t blockSize) { uint32_t blkCnt; /* loop counter / #ifndef ARM_MATH_CM0_FAMILY / Run the below code for Cortex-M4 and Cortex-M3 / /loop Unrolling / blkCnt = blockSize >> 2u; / First part of the processing with loop unrolling. Compute 4 outputs at a time. ** a second loop below computes the remaining 1 to 3 samples. / while(blkCnt > 0u) { / C = A - B / / Subtract and then store the results in the destination buffer 4 samples at a time. / __SIMD32(pDst)++ = __QSUB8(__SIMD32(pSrcA)++, __SIMD32(pSrcB)++); (1) /* Decrement the loop counter / blkCnt--; } / If the blockSize is not a multiple of 4, compute any remaining output samples here. ** No loop unrolling is used. / blkCnt = blockSize % 0x4u; while(blkCnt > 0u) { / C = A - B / / Subtract and then store the result in the destination buffer. / pDst++ = __SSAT(pSrcA++ - pSrcB++, 8); /* Decrement the loop counter / blkCnt--; } #else / Run the below code for Cortex-M0 / / Initialize blkCnt with number of samples / blkCnt = blockSize; while(blkCnt > 0u) { / C = A - B / / Subtract and then store the result in the destination buffer. / pDst++ = (q7_t) __SSAT((q15_t) * pSrcA++ - pSrcB++, 8); / Decrement the loop counter / blkCnt--; } #endif / #ifndef ARM_MATH_CM0_FAMILY */ } 1. __QSUB8也是SIMD指令，调用一次就能实现4个Q7格式数据的减法运算。

赞回复举报

lee_st · 2016-9-23 08:01:10 24^# 9.4.5 实例讲解实验目的： 1. 四种种类型数据的减法。实验内容： 1. 按下按键UP, 串口打印输出结果实验现象：通过窗口上位机软件SecureCRT（V5光盘里面有此软件）查看打印信息现象如下：

赞回复举报

lee_st · 2016-9-23 08:01:37 25^# 程序设计：复制代码 /* ********************************************************************************************************* * 函数名: DSP_Sub * 功能说明: 减法 * 形参：无 * 返回值: 无 ********************************************************************************************************* / static void DSP_Sub(void) { static float32_t pSrcA[5] = {1.0f,1.0f,1.0f,1.0f,1.0f}; static float32_t pSrcB[5] = {1.0f,1.0f,1.0f,1.0f,1.0f}; static float32_t pDst[5]; static q31_t pSrcA1[5] = {1,1,1,1,1}; static q31_t pSrcB1[5] = {1,1,1,1,1}; static q31_t pDst1[5]; static q15_t pSrcA2[5] = {1,1,1,1,1}; static q15_t pSrcB2[5] = {1,1,1,1,1}; static q15_t pDst2[5]; static q7_t pSrcA3[5] = {0x70,1,1,1,1}; static q7_t pSrcB3[5] = {0x7f,1,1,1,1}; static q7_t pDst3[5]; pSrcA[0] += 1.1f; arm_sub_f32(pSrcA, pSrcB, pDst, 5); printf("arm_sub_f32 = %frn", pDst[0]); pSrcA1[0] += 1; arm_sub_q31(pSrcA1, pSrcB1, pDst1, 5); printf("arm_sub_q31 = %drn", pDst1[0]); pSrcA2[0] += 1; arm_sub_q15(pSrcA2, pSrcB2, pDst2, 5); printf("arm_sub_q15 = %drn", pDst2[0]); pSrcA3[0] += 1; arm_sub_q7(pSrcA3, pSrcB3, pDst3, 5); printf("arm_sub_q7 = %drn", pDst3[0]); printf("**********************************rn"); }

赞回复举报

lee_st · 2016-9-23 08:01:56 26^# 9.5 比例因子（Vector Scale）这部分函数主要用于实现数据的比例放大和缩小，浮点数据公式描述如下： pDst[n] = pSrc[n] * scale, 0 <= n < blockSize. 如果是Q31，Q15，Q7格式的数据，公式描述如下： pDst[n] = (pSrc[n] * scaleFract) << shift, 0 <= n < blockSize. 这种情况下，比例因子就是： scale = scaleFract * 2^shift. 注意，这部分函数支持目标指针和源指针指向相同的缓冲区。

赞回复举报

lee_st · 2016-9-23 08:02:16 27^# 9.5.1 arm_scale_f32 这个函数用于求32位浮点数的比例放缩，源代码分析如下：复制代码 /** * @brief Multiplies a floating-point vector by a scalar. * @param[in] pSrc points to the input vector @param[in] scale scale factor to be applied * @param[out] pDst points to the output vector @param[in] blockSize number of samples in the vector * @return none. / void arm_scale_f32( float32_t pSrc, float32_t scale, float32_t * pDst, uint32_t blockSize) { uint32_t blkCnt; /* loop counter / #ifndef ARM_MATH_CM0_FAMILY / Run the below code for Cortex-M4 and Cortex-M3 / float32_t in1, in2, in3, in4; / temporary variabels / /loop Unrolling / blkCnt = blockSize >> 2u; / First part of the processing with loop unrolling. Compute 4 outputs at a time. ** a second loop below computes the remaining 1 to 3 samples. / while(blkCnt > 0u) { / C = A * scale / / Scale the input and then store the results in the destination buffer. / / read input samples from source / in1 = pSrc; in2 = (pSrc + 1); / multiply with scaling factor / (1) in1 = in1 scale; /* read input sample from source / in3 = (pSrc + 2); /* multiply with scaling factor / in2 = in2 scale; /* read input sample from source / in4 = (pSrc + 3); /* multiply with scaling factor / in3 = in3 scale; in4 = in4 * scale; /* store the result to destination / pDst = in1; (pDst + 1) = in2; (pDst + 2) = in3; (pDst + 3) = in4; / update pointers to process next samples / pSrc += 4u; pDst += 4u; / Decrement the loop counter / blkCnt--; } / If the blockSize is not a multiple of 4, compute any remaining output samples here. ** No loop unrolling is used. / blkCnt = blockSize % 0x4u; #else / Run the below code for Cortex-M0 / / Initialize blkCnt with number of samples / blkCnt = blockSize; #endif / #ifndef ARM_MATH_CM0_FAMILY / while(blkCnt > 0u) { / C = A * scale / / Scale the input and then store the result in the destination buffer. / pDst++ = (pSrc++) scale; /* Decrement the loop counter */ blkCnt--; } } 1. 浮点数据的比例因子计算比较简单，源浮点数相应相应的比例因子即可。

赞回复举报

lee_st · 2016-9-23 08:02:30 28^# 9.5.2 arm_scale_q31 这个函数用于求32位定点数的比例放缩，源代码分析如下：复制代码 /** * @brief Multiplies a Q31 vector by a scalar. * @param[in] pSrc points to the input vector @param[in] scaleFract fractional portion of the scale value * @param[in] shift number of bits to shift the result by * @param[out] pDst points to the output vector @param[in] blockSize number of samples in the vector * @return none. * * Scaling and Overflow Behavior: (1) * par * The input data `pSrc` and `scaleFract` are in 1.31 format. These are multiplied to yield a 2.62 intermediate result and this is shifted with saturation to 1.31 format. / void arm_scale_q31( q31_t pSrc, q31_t scaleFract, int8_t shift, q31_t * pDst, uint32_t blockSize) { int8_t kShift = shift + 1; /* Shift to apply after scaling / (2) int8_t sign = (kShift & 0x80); uint32_t blkCnt; / loop counter / q31_t in, out; #ifndef ARM_MATH_CM0_FAMILY / Run the below code for Cortex-M4 and Cortex-M3 / q31_t in1, in2, in3, in4; / temporary input variables / q31_t out1, out2, out3, out4; / temporary output variabels / /loop Unrolling / blkCnt = blockSize >> 2u; if(sign == 0u) (3) { / First part of the processing with loop unrolling. Compute 4 outputs at a time. ** a second loop below computes the remaining 1 to 3 samples. / while(blkCnt > 0u) { / read four inputs from source / in1 = pSrc; in2 = (pSrc + 1); in3 = (pSrc + 2); in4 = (pSrc + 3); / multiply input with scaler value / (4) in1 = ((q63_t) in1 scaleFract) >> 32; in2 = ((q63_t) in2 * scaleFract) >> 32; in3 = ((q63_t) in3 * scaleFract) >> 32; in4 = ((q63_t) in4 * scaleFract) >> 32; /* apply shifting / out1 = in1 << kShift; out2 = in2 << kShift; / saturate the results. / if(in1 != (out1 >> kShift)) (5) out1 = 0x7FFFFFFF ^ (in1 >> 31); if(in2 != (out2 >> kShift)) out2 = 0x7FFFFFFF ^ (in2 >> 31); out3 = in3 << kShift; out4 = in4 << kShift; pDst = out1; (pDst + 1) = out2; if(in3 != (out3 >> kShift)) out3 = 0x7FFFFFFF ^ (in3 >> 31); if(in4 != (out4 >> kShift)) out4 = 0x7FFFFFFF ^ (in4 >> 31); / Store result destination / (pDst + 2) = out3; (pDst + 3) = out4; / Update pointers to process next sampels / pSrc += 4u; pDst += 4u; / Decrement the loop counter / blkCnt--; } } else { / First part of the processing with loop unrolling. Compute 4 outputs at a time. ** a second loop below computes the remaining 1 to 3 samples. / while(blkCnt > 0u) { / read four inputs from source / in1 = pSrc; in2 = (pSrc + 1); in3 = (pSrc + 2); in4 = (pSrc + 3); / multiply input with scaler value / in1 = ((q63_t) in1 scaleFract) >> 32; in2 = ((q63_t) in2 * scaleFract) >> 32; in3 = ((q63_t) in3 * scaleFract) >> 32; in4 = ((q63_t) in4 * scaleFract) >> 32; /* apply shifting / (6) out1 = in1 >> -kShift; out2 = in2 >> -kShift; out3 = in3 >> -kShift; out4 = in4 >> -kShift; / Store result destination / pDst = out1; (pDst + 1) = out2; (pDst + 2) = out3; (pDst + 3) = out4; / Update pointers to process next sampels / pSrc += 4u; pDst += 4u; / Decrement the loop counter / blkCnt--; } } / If the blockSize is not a multiple of 4, compute any remaining output samples here. ** No loop unrolling is used. / blkCnt = blockSize % 0x4u; #else / Run the below code for Cortex-M0 / / Initialize blkCnt with number of samples / blkCnt = blockSize; #endif / #ifndef ARM_MATH_CM0_FAMILY / if(sign == 0) { while(blkCnt > 0u) { / C = A * scale / / Scale the input and then store the result in the destination buffer. / in = pSrc++; in = ((q63_t) in * scaleFract) >> 32; out = in << kShift; if(in != (out >> kShift)) out = 0x7FFFFFFF ^ (in >> 31); pDst++ = out; / Decrement the loop counter / blkCnt--; } } else { while(blkCnt > 0u) { / C = A * scale / / Scale the input and then store the result in the destination buffer. / in = pSrc++; in = ((q63_t) in * scaleFract) >> 32; out = in >> -kShift; pDst++ = out; / Decrement the loop counter / blkCnt--; } } } 1. 源数据和比例因子都是Q31格式。这样他们的乘积就是1.31 1.31 = 2.62格式。由于输出结果也是Q31格式，那么源数据和比例因子的乘积需要右移32位，并且输出结果需要饱和处理。 2. 这里不清楚为什么要加1操作，留作以后解决。 3. 如果位移是正值，那么就是左移位，否则就是右移位。 4. 将源数据和比例因子的乘积左移32位，保证结果也是Q31格式。 5. 这里是对结果的饱和处理。 6. 数值的右移不存在饱和问题，这里直接取反即可。

赞回复举报

lee_st · 2016-9-23 08:02:41 29^# 9.5.3 arm_scale_q15 这个函数用于求16位定点数的比例放缩，源代码分析如下：复制代码 /** * @brief Multiplies a Q15 vector by a scalar. * @param[in] pSrc points to the input vector @param[in] scaleFract fractional portion of the scale value * @param[in] shift number of bits to shift the result by * @param[out] pDst points to the output vector @param[in] blockSize number of samples in the vector * @return none. * * Scaling and Overflow Behavior: (1) * par * The input data `pSrc` and `scaleFract` are in 1.15 format. These are multiplied to yield a 2.30 intermediate result and this is shifted with saturation to 1.15 format. / void arm_scale_q15( q15_t pSrc, q15_t scaleFract, int8_t shift, q15_t * pDst, uint32_t blockSize) { int8_t kShift = 15 - shift; /* shift to apply after scaling / (2) uint32_t blkCnt; / loop counter / #ifndef ARM_MATH_CM0_FAMILY / Run the below code for Cortex-M4 and Cortex-M3 / q15_t in1, in2, in3, in4; q31_t inA1, inA2; / Temporary variables / q31_t out1, out2, out3, out4; /loop Unrolling / blkCnt = blockSize >> 2u; / First part of the processing with loop unrolling. Compute 4 outputs at a time. ** a second loop below computes the remaining 1 to 3 samples. / while(blkCnt > 0u) { / Reading 2 inputs from memory / inA1 = __SIMD32(pSrc)++; (3) inA2 = __SIMD32(pSrc)++; / C = A * scale / / Scale the inputs and then store the 2 results in the destination buffer * in single cycle by packing the outputs / out1 = (q31_t) ((q15_t) (inA1 >> 16) scaleFract); (4) out2 = (q31_t) ((q15_t) inA1 * scaleFract); out3 = (q31_t) ((q15_t) (inA2 >> 16) * scaleFract); out4 = (q31_t) ((q15_t) inA2 * scaleFract); /* apply shifting / out1 = out1 >> kShift; out2 = out2 >> kShift; out3 = out3 >> kShift; out4 = out4 >> kShift; / saturate the output / in1 = (q15_t) (__SSAT(out1, 16)); (5) in2 = (q15_t) (__SSAT(out2, 16)); in3 = (q15_t) (__SSAT(out3, 16)); in4 = (q15_t) (__SSAT(out4, 16)); / store the result to destination / (6) __SIMD32(pDst)++ = __PKHBT(in2, in1, 16); __SIMD32(pDst)++ = __PKHBT(in4, in3, 16); / Decrement the loop counter / blkCnt--; } / If the blockSize is not a multiple of 4, compute any remaining output samples here. ** No loop unrolling is used. / blkCnt = blockSize % 0x4u; while(blkCnt > 0u) { / C = A * scale / / Scale the input and then store the result in the destination buffer. / pDst++ = (q15_t) (__SSAT(((pSrc++) scaleFract) >> kShift, 16)); /* Decrement the loop counter / blkCnt--; } #else / Run the below code for Cortex-M0 / / Initialize blkCnt with number of samples / blkCnt = blockSize; while(blkCnt > 0u) { / C = A * scale / / Scale the input and then store the result in the destination buffer. / pDst++ = (q15_t) (__SSAT(((q31_t) * pSrc++ * scaleFract) >> kShift, 16)); /* Decrement the loop counter / blkCnt--; } #endif / #ifndef ARM_MATH_CM0_FAMILY / } 1. 源数据和比例因子的数据格式都是Q15，这样的话，输出结果就是1.15 1.15 = 2.30格式，由于输出结果也是Q15格式，所以输出结果需要饱和处理。 2. 这个变量设计很巧妙，这样下面处理正数左移和负数右移就很方面了，可以直接使用一个右移就可以实现。 3. 读取两个Q15格式的数据。 4. 将源数据乘以比例因子后赋值给Q31格式的变量。 5. 对输出结果做饱和处理。 6. 通过调用一次__PKHBT指令，将两个Q15格式的数据都赋值给目的变量。

赞回复举报

lee_st · 2016-9-23 08:02:54 30^# 9.5.4 arm_scale_q7 这个函数用于求8位定点数的比例放缩，源代码分析如下：复制代码 /** * @brief Multiplies a Q7 vector by a scalar. * @param[in] pSrc points to the input vector @param[in] scaleFract fractional portion of the scale value * @param[in] shift number of bits to shift the result by * @param[out] pDst points to the output vector @param[in] blockSize number of samples in the vector * @return none. * * Scaling and Overflow Behavior: (1) * par * The input data `pSrc` and `scaleFract` are in 1.7 format. These are multiplied to yield a 2.14 intermediate result and this is shifted with saturation to 1.7 format. / void arm_scale_q7( q7_t pSrc, q7_t scaleFract, int8_t shift, q7_t * pDst, uint32_t blockSize) { int8_t kShift = 7 - shift; /* shift to apply after scaling / (2) uint32_t blkCnt; / loop counter / #ifndef ARM_MATH_CM0_FAMILY / Run the below code for Cortex-M4 and Cortex-M3 / q7_t in1, in2, in3, in4, out1, out2, out3, out4; / Temporary variables to store input & output / /loop Unrolling / blkCnt = blockSize >> 2u; / First part of the processing with loop unrolling. Compute 4 outputs at a time. ** a second loop below computes the remaining 1 to 3 samples. / while(blkCnt > 0u) { / Reading 4 inputs from memory / in1 = pSrc++; in2 = pSrc++; in3 = pSrc++; in4 = pSrc++; / C = A * scale / / Scale the inputs and then store the results in the temporary variables. / out1 = (q7_t) (__SSAT(((in1) scaleFract) >> kShift, 8)); (3) out2 = (q7_t) (__SSAT(((in2) * scaleFract) >> kShift, 8)); out3 = (q7_t) (__SSAT(((in3) * scaleFract) >> kShift, 8)); out4 = (q7_t) (__SSAT(((in4) * scaleFract) >> kShift, 8)); /* Packing the individual outputs into 32bit and storing in * destination buffer in single write / __SIMD32(pDst)++ = __PACKq7(out1, out2, out3, out4); (4) /* Decrement the loop counter / blkCnt--; } / If the blockSize is not a multiple of 4, compute any remaining output samples here. ** No loop unrolling is used. / blkCnt = blockSize % 0x4u; while(blkCnt > 0u) { / C = A * scale / / Scale the input and then store the result in the destination buffer. / pDst++ = (q7_t) (__SSAT(((pSrc++) scaleFract) >> kShift, 8)); /* Decrement the loop counter / blkCnt--; } #else / Run the below code for Cortex-M0 / / Initialize blkCnt with number of samples / blkCnt = blockSize; while(blkCnt > 0u) { / C = A * scale / / Scale the input and then store the result in the destination buffer. / pDst++ = (q7_t) (__SSAT((((q15_t) * pSrc++ * scaleFract) >> kShift), 8)); /* Decrement the loop counter / blkCnt--; } #endif / #ifndef ARM_MATH_CM0_FAMILY / } 1. 源数据和比例因子的数据格式都是Q7，这样的话，输出结果就是1.7 1.7 = 2.14格式，由于输出结果也是Q7格式，所以输出结果需要饱和处理。 2. 这个变量设计很巧妙，这样下面处理正数左移和负数右移就很方面了，可以直接使用一个右移就可以实现。 3. 对源数据和比例因子的输出结果做8位精度的饱和处理。

赞回复举报

lee_st · 2016-9-23 08:03:53 31^# 9.5.5 实例讲解实验目的： 1. 四种种类型数据的比例放缩。实验内容： 1. 按下按键DOWN 串口打印输出结果实验现象：通过窗口上位机软件SecureCRT（V5光盘里面有此软件）查看打印信息现象如下：

赞回复举报

lee_st · 2016-9-23 08:04:13 32^# 程序设计：复制代码 /* ********************************************************************************************************* * 函数名: DSP_Scale * 功能说明: 比例因子 * 形参：无 * 返回值: 无 ********************************************************************************************************* / static void DSP_Scale(void) { static float32_t pSrcA[5] = {1.0f,1.0f,1.0f,1.0f,1.0f}; static float32_t scale = 0.0f; static float32_t pDst[5]; static q31_t pSrcA1[5] = {0x6fffffff,1,1,1,1}; static q31_t scale1 = 0x6fffffff; static q31_t pDst1[5]; static q15_t pSrcA2[5] = {0x6fff,1,1,1,1}; static q15_t scale2 = 0x6fff; static q15_t pDst2[5]; static q7_t pSrcA3[5] = {0x70,1,1,1,1}; static q7_t scale3 = 0x6f; static q7_t pDst3[5]; scale += 0.1f; arm_scale_f32(pSrcA, scale, pDst, 5); printf("arm_sub_f32 = %frn", pDst[0]); scale1 += 1; arm_scale_q31(pSrcA1, scale1, 0, pDst1, 5); printf("arm_scale_q31 = %xrn", pDst1[0]); scale2 += 1; arm_scale_q15(pSrcA2, scale2, 0, pDst2, 5); printf("arm_scale_q15 = %xrn", pDst2[0]); scale3 += 1; arm_scale_q7(pSrcA3, scale3, 0, pDst3, 5); printf("arm_scale_q7 = %xrn", pDst3[0]); printf("**********************************rn"); }

赞回复举报

lee_st · 2016-9-23 08:04:43 33^# 9.6 BasicMathFunctions的重要说明截至到这里，BasicMathFunctions函数已经讲解完了，也许大家也发现了这些函数的一些共同点，在前面第8章的时候我们简单的阐述过，这里再进一步的阐述一下： l 这些函数基本都是支持重入的。 l 基本每个函数都有四种数据类型，F32，Q31，Q15，Q7。 l 函数中数值的处理基本都是4个为一组，这么做的原因是F32，Q31，Q15，Q7就可以统一采用一个程序设计架构，便于管理。更重要的是可以在Q15和Q7数据处理中很好的发挥SIMD指令的作用（因为4个为一组的话，可以用SIMD指令正好处理2个Q15数据或者4个Q7数据）。 l 部分函数是支持目标指针和源指针指向相同的缓冲区。关于这个的使用，我们没有在前面的讲解中举例子，下面举一个简单的例子进行说明，这里就以9.5小节中scale函数进行说明：复制代码 static void DSP_Scale(void) { static float32_t pSrcA[5] = {1.0f,1.0f,1.0f,1.0f,1.0f}; static float32_t scale = 0.0f; static q31_t pSrcA1[5] = {0x6fffffff,1,1,1,1}; static q31_t scale1 = 0x6fffffff; static q15_t pSrcA2[5] = {0x6fff,1,1,1,1}; static q15_t scale2 = 0x6fff; static q7_t pSrcA3[5] = {0x70,1,1,1,1}; static q7_t scale3 = 0x6f; scale += 0.1f; arm_scale_f32(pSrcA, scale, pSrcA, 5); (1) printf("arm_sub_f32 = %frn", pSrcA[0]); scale1 += 1; arm_scale_q31(pSrcA1, scale1, 0, pSrcA1, 5); (2) printf("arm_scale_q31 = %xrn", pSrcA1[0]); scale2 += 1; arm_scale_q15(pSrcA2, scale2, 0, pSrcA2, 5); (3) printf("arm_scale_q15 = %xrn", pSrcA2[0]); scale3 += 1; arm_scale_q7(pSrcA3, scale3, 0, pSrcA3, 5); (4) printf("arm_scale_q7 = %xrn", pSrcA3[0]); printf("***********************************rn"); } 上面代码的（1）至（4）目标指针和源指针指向相同的缓冲区。

赞回复举报

lee_st · 2016-9-23 08:05:22 34^# 9.7 总结 BasicMathFunctions函数就跟大家讲这么多，希望初学的同学多多的联系，并在自己以后的项目中多多使用，效果必将事半功倍。

赞回复举报

lee_st · 2016-9-23 08:05:42 35^# 分享完成，，，，，，，，，，，，，，

赞回复举报

评论

声明：本文内容及配图由入驻作者撰写或者入驻合作网站授权转载。文章观点仅代表作者本人，不代表电子发烧友网立场。文章及其配图仅供工程师学习之用，如有内容图片侵权或者其他问题，请联系本站作侵删。侵权投诉

上一页 12 / 2 页

发资料

精选推荐

【敏矽微ME32G070开发板免费体验】新建工程（MDK）

386 浏览 0 评论
求助一下关于51系列单片机的Timer0的计时问题，TH0、TL0+1的时间是怎么算的？

1670 浏览 1 评论
【RA-Eco-RA4E2-64PIN-V1.0开发板试用】开箱+Keil环境搭建+点灯+点亮OLED

1123 浏览 0 评论
【敏矽微ME32G070开发板免费体验】使用coremark测试敏矽微ME32G070 跑分

1005 浏览 0 评论
【敏矽微ME32G070开发板免费体验】开箱+点灯+点亮OLED

1229 浏览 2 评论

热门帖

【youyeetoo X1 windows 开发板体验】少儿AI智能STEAM积木平台

12018 浏览 31 评论

快速回复 返回顶部 返回列表

关注微信公众号

电子发烧友网

电子发烧友论坛

社区合作: 刘勇; 联系电话：15994832713; 邮箱地址：liuyong@huaqiu.com

社区管理: elecfans短短; 微信：elecfans_666; 邮箱：users@huaqiu.com

【优惠升级】华秋PCB首单最高立减100元，SMT免费贴片！！！

返回单片机/MCU论坛

上一页 12 / 2 页

回复

关闭

站长推荐 /6

快速回复 返回顶部 返回列表

- 技术社区: HarmonyOS技术社区

RISC-V MCU技术社区

FPGA开发者技术社区

- OpenHarmony开源社区: OpenHarmony开源社区

- 嵌入式论坛: ARM技术论坛

STM32/STM8技术论坛

嵌入式技术论坛

单片机/MCU论坛

RISC-V技术论坛

瑞芯微Rockchip开发者社区

FPGA|CPLD|ASIC论坛

DSP论坛

- 电路图及DIY: 电路设计论坛

DIY及创意

电子元器件论坛

专家问答

- 电源技术论坛: 电源技术论坛

无线充电技术

- 综合技术与应用: 机器人论坛

USB论坛

电机控制

模拟技术

音视频技术

综合技术交流

上位机软件（C/Python/Java等）

- 无线通信论坛: WIFI技术

蓝牙技术

天线|RF射频|微波|雷达技术

- EDA设计论坛: PCB设计论坛

DigiPCBA论坛

Protel|AD|DXP论坛

PADS技术论坛

Allegro论坛

multisim论坛

proteus论坛|仿真论坛

KiCad EDA 中文论坛

DFM|可制造性设计论坛

- 测试测量论坛: LabVIEW论坛

Matlab论坛

测试测量技术

传感技术

- 招聘/交友/外包/交易/杂谈: 项目外包

供需及二手交易

工程师杂谈|交友

招聘|求职|工程师职场

- 官方社区: 发烧友官方/活动

华秋商城

华秋电路

time

recommend

hot

post

—
—
—

版
块
导
航